push stats on demand #91

adRn-s · 2024-09-13T12:12:15Z

Adressing #89

Not tested.

WardDeb

I left two specific comments in the code, but have two more general points:

How I'd understand it:
One runs BRB --stats --fid path/to/flowcell
-> don't run BRB.PushButton.GetResults, only phoneHome or telegraphHome where appropriate
One runs BRB without --stats / --fid and there is an unprocessed flow cell
-> run workflow as it's implemented now (GetResults workflow)
One runs BRB without --stats / --fid and there is no unprocessed flow cell
-> sleep

I only see a query to parkour under the stats is not None arm, am I misunderstanding the implementation maybe ?

Secondly, It should be easy to test a stats/fid implementation by comparing the json to the one pushed to parkour previously, it's probably safer to test this before just pushing these changes over.

BRB/run.py

adRn-s · 2024-09-17T12:46:53Z

You're not misunderstanding the implementation. So, yes. BRB without --stats, should be the original.

The idea, is that this one is there, left running as always. On top of it, no matter what that process is doing (if there's a new flowcell, or not, and it's just sleeping) we could run BRB with --stats and a FCID, to push those stats to Parkour.

Secondly, It should be easy to test a stats/fid implementation by comparing the json to the one pushed to parkour previously, it's probably safer to test this before just pushing these changes over.

I will try that! 👍🏽

adRn-s · 2024-09-18T09:40:02Z

The newly added functionality works. I didn't compare the json file(s) pushed to Parkour, but saw the emails in my inbox and they look coherent, and since we didn't change how the info is parsed or anything else; it should all be set.

The only thing left, perhaps, is testing that the old functionality is there without any new issues. It should be the case, but still.

As a side note, I have modified the "external data" print statement in PushButton because I was puzzled by it. I can't say that it's anything clearer now... For example, BigRedButton -c /home/pipegrp/adRn/brb.ini --stats --fcid="BH7WMYDRX5" outputs:

Logging into: /home/pipegrp/adRn/logs/brb/240715_A00931_0732_BH7WMYDRX5_lanes_1_2.log
GetResults with ignore=True, 3211_Holec_Hilgers is external data.
GetResults with ignore=True, 3237_Tripathy_Akhtar is external data.
GetResults with ignore=True, 3241_Förtsch_Akhtar is external data.

BRB/run.py

WardDeb · 2024-09-19T07:02:10Z

BRB/run.py

+
+
+        # Process each group's data, ignore cases where the project isn't in the lanes being processed
+        process_data(config, ParkourDict)


I still fail to understand the logic here, sorry..

stats not set -> only set a log file
stats set -> infer some lane status and set some other paths ?

Afterwards all the data gets processed anyway. For me this is effectively the same as running BRB twice ( with the added disadvantage that we are hardcoded for two lanes now).

If you don't have FID as an argument, but an actual path to a processed flowcell (which might be split per lane or not, but at least you don't have to infer it here), and subsequently have:
if not stats:
process_data
else:
if analysis was actually done already (because analysis worked, parkour was just dead) -> run phoneHome
else no analysis was actually done (because non-std libtype, or external data) -> run telegraphHome

WardDeb · 2024-09-19T07:03:30Z

Logging into: /home/pipegrp/adRn/logs/brb/240715_A00931_0732_BH7WMYDRX5_lanes_1_2.log
GetResults with ignore=True, 3211_Holec_Hilgers is external data.
GetResults with ignore=True, 3237_Tripathy_Akhtar is external data.
GetResults with ignore=True, 3241_Förtsch_Akhtar is external data.

My guess is the actual path constructed from the flow cells is wrong (does lanes_1_2 even exist, or we only have lanes_1 and lanes_2 as separate dirs). Perhaps the other comment clarifies some things, otherwise we could discuss this in person.

first draft

a78704a

adRn-s requested a review from WardDeb September 13, 2024 12:14

adRn-s added 4 commits September 13, 2024 14:16

fix function not found

3962727

fix function not found (1)

f2c9bea

use python convenient idiom

dc90f84

missing import

c38951a

WardDeb reviewed Sep 17, 2024

View reviewed changes

BRB/run.py Outdated Show resolved Hide resolved

BRB/run.py Show resolved Hide resolved

draft inter-relationship

4149a26

Works.

eab9ade

WardDeb reviewed Sep 19, 2024

View reviewed changes

BRB/run.py Show resolved Hide resolved

WardDeb reviewed Sep 19, 2024

View reviewed changes

Pipeline Project User and others added 2 commits September 19, 2024 12:24

avoid confusing 1_2 in log filename for dual_lane

a3f24c2

tidy-up

c758568

adRn-s force-pushed the stats branch from f4ea667 to c758568 Compare September 19, 2024 11:43

🌙 🚲

8570fbc

adRn-s force-pushed the stats branch from a375c44 to 8570fbc Compare September 19, 2024 12:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

push stats on demand #91

push stats on demand #91

adRn-s commented Sep 13, 2024

WardDeb left a comment

adRn-s commented Sep 17, 2024

adRn-s commented Sep 18, 2024

WardDeb Sep 19, 2024

WardDeb commented Sep 19, 2024



		# Process each group's data, ignore cases where the project isn't in the lanes being processed
		process_data(config, ParkourDict)

push stats on demand #91

Are you sure you want to change the base?

push stats on demand #91

Conversation

adRn-s commented Sep 13, 2024

WardDeb left a comment

Choose a reason for hiding this comment

adRn-s commented Sep 17, 2024

adRn-s commented Sep 18, 2024

WardDeb Sep 19, 2024

Choose a reason for hiding this comment

WardDeb commented Sep 19, 2024