Add capability for inputting full ForenSeq sequences (not run through UAS) #16

rnmitchell · 2020-04-23T14:18:33Z

Adding capability to input full ForenSeq sequences which have not been run through the UAS (#14).

This requires:

Adding CLI (--uas flag) for indicating if sequences have been run through the UAS for the lusstr annotate command
Adding the number of bases to remove from the 5' and 3' ends of the full sequence to the str_markers.json file in order to produce the UAS region of the sequence.
Incorporating code into the annot.py script to remove the 5' and 3' bases to create the sequence of the UAS region (for further annotation).
Create bracketed annotation form of the full sequence.

rnmitchell · 2020-04-23T14:31:34Z

Oops... didn't mean to try to merge that...

rnmitchell · 2020-04-24T15:17:13Z

We can probably merge this branch into the master, @standage. The other things I want to do are a bit more complicated and will require more time.

standage

Overall looks good, pending some minor comments.

lusSTR/str_markers.json

lusSTR/annot.py

standage · 2020-04-24T15:46:58Z

I'll merge when the remaining comments are resolved.

rnmitchell · 2020-04-24T15:58:03Z

Ok, @standage, I think now it's ready! :)

lusSTR/annot.py

rnmitchell · 2020-04-27T20:05:46Z

I'm having issues with writing the code for this test and thought maybe @standage you could solve it in 5 seconds... When I run the test it's cutting off the header of the file it's creating (and so the test is failing)... thoughts? If I remove the header from the comparison file, the test passes. It's driving me crazy.

rnmitchell · 2020-04-28T12:36:10Z

Ok, so I'm seeing that the NamedTemporaryFile() function creates the file immediately (thus lusSTR sees the file exists and doesn't add the header). Is there another way to create a temp file? Or should I just remove the header of the test file?

…koverflow.com/a/20670757/459780

standage · 2020-04-28T15:36:20Z

I was hoping there was a way to tell NamedTemporaryFile to just return a filename without creating the file. There is a mktemp() function, but it is deprecated due to its risk for race conditions.

So the simplest solution I could come up with, silly as it sounds, is to delete the file before running lusSTR annotate.

standage · 2020-04-28T15:43:44Z

The test suite is passing now, w00t!

Do we have any data sets that we can run without the --uas flag, so that the resolve_uas_sequence() code gets invoked?

rnmitchell · 2020-04-28T17:08:49Z

yes! Just added the test and all pass for me.

standage and others added 2 commits April 2, 2020 12:05

Make sure all files are bundled correctly

dd4f54c

fixed style issues

40b4f8b

Rebecca Mitchell added 4 commits April 23, 2020 14:35

Added full length sequence capability

fc444f7

Added PowerSeq cut points

4040bd2

Added PowerSeq option

34b18c7

added test

066c45e

standage changed the title ~~WIP: Add capability for inputting full ForenSeq sequences (not run through UAS)~~ Add capability for inputting full ForenSeq sequences (not run through UAS) Apr 24, 2020

Merge branch 'master' into fullsequences

6b022eb

standage reviewed Apr 24, 2020

View reviewed changes

lusSTR/str_markers.json Outdated Show resolved Hide resolved

lusSTR/annot.py Outdated Show resolved Hide resolved

lusSTR/annot.py Outdated Show resolved Hide resolved

lusSTR/annot.py Outdated Show resolved Hide resolved

Clean up conditional

22dca2d

minor fixes

9078590

standage reviewed Apr 24, 2020

View reviewed changes

lusSTR/annot.py Outdated Show resolved Hide resolved

standage and others added 5 commits April 24, 2020 12:35

Refactor trimming

197cccc

More refactoring

cc5f28a

Clean up makefile

7060c4c

Fix CI config

567c340

added UAS annotate test

10e4689

revised test

556ec5c

standage added 2 commits April 28, 2020 11:32

Fix test by deleting output file before lusstr annotate writes to it

6e7fa72

Rename variable to avoid conflicts with builtin function https://stac…

3558926

…koverflow.com/a/20670757/459780

added full seq test

37ce9c2

standage approved these changes Apr 28, 2020

View reviewed changes

standage merged commit 4037b13 into master Apr 28, 2020

standage deleted the fullsequences branch April 28, 2020 17:32

This was referenced May 4, 2020

Add PowerSeq sequences #15

Closed

Add ability to input full sequences #14

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add capability for inputting full ForenSeq sequences (not run through UAS) #16

Add capability for inputting full ForenSeq sequences (not run through UAS) #16

rnmitchell commented Apr 23, 2020

rnmitchell commented Apr 23, 2020

rnmitchell commented Apr 24, 2020

standage left a comment

standage commented Apr 24, 2020

rnmitchell commented Apr 24, 2020

rnmitchell commented Apr 27, 2020

rnmitchell commented Apr 28, 2020

standage commented Apr 28, 2020

standage commented Apr 28, 2020

rnmitchell commented Apr 28, 2020

Add capability for inputting full ForenSeq sequences (not run through UAS) #16

Add capability for inputting full ForenSeq sequences (not run through UAS) #16

Conversation

rnmitchell commented Apr 23, 2020

rnmitchell commented Apr 23, 2020

rnmitchell commented Apr 24, 2020

standage left a comment

Choose a reason for hiding this comment

standage commented Apr 24, 2020

rnmitchell commented Apr 24, 2020

rnmitchell commented Apr 27, 2020

rnmitchell commented Apr 28, 2020

standage commented Apr 28, 2020

standage commented Apr 28, 2020

rnmitchell commented Apr 28, 2020