Monarch geneset OGS2.0

DPOGS206863
TranscriptDPOGS206863-TA2934 bp
ProteinDPOGS206863-PA977 aa
Genomic positionDPSCF300001 - 2559218-2565876
RNAseq coverage260x (Rank: top 41%)
Annotation
HeliconiusHMEL0140460.053.49% 
BombyxBGIBMGA012814-TA4e-17659.05% 
Drosophilasu(w[a])-PA1e-7536.32% 
EBI UniRef50UniRef50_UPI00020645515e-9841.51%UPI0002064551 related cluster n=3 Tax=unknown RepID=UPI0002064551
NCBI RefSeqXP_001603801.15e-8554.63%PREDICTED: similar to GA15714-PA [Nasonia vitripennis]
NCBI nr blastpgi|3504136323e-9842.77%PREDICTED: hypothetical protein LOC100749326 [Bombus impatiens]
NCBI nr blastxgi|3504136322e-10732.06%PREDICTED: hypothetical protein LOC100749326 [Bombus impatiens]
Group
Gene OntologyGO:00063964.3e-16RNA processing
GO:00037234.3e-16RNA binding
KEGG pathway 
InterPro domain[27-143] IPR0191474.1e-30Splicing factor, suppressor of white apricot
[182-235] IPR0000614.3e-16SWAP/Surp
Orthology groupMCL17043 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206863-TA
ATGTCTCTGAAATGGACTGGAAATCACAATGAAACTGGTATACTAAGAAAGAGTGATGCAAGCAGAGAGAAGAAAGAAGAGTTATTCGTGTTCGGGTATTCGTGCAAACTGTTCAGGGATGATGATAAAGCTCTGCACATTGACCAAGGAAAGCATCTCATTCCGTGGATGGGGGATGAAACTTTGAAAATCGACAGGTATGACGCAAGAGGGGCCCTCCATGATTTGGTGTCCTTAGAAGCTCCACCTGGTGGCTTCGACTGGCGTGTGGAGCTTTCCAGATCTGAACAGGATGTGGAACAGCTCTGTGATGAGGAGAGATACCGGGCCCTACATACTGATGAAGATGAGGAAGAAATGTATAAGGAGGAAGAGCTTAAACGGCTCCATGCAGCAGGTTATGGTCAAGTCGGCTTCAACTATGACGCTCCAGCTGAAACACCTCCTGAACCTCCAGTAGAAATTGAAGAGCCCTTTGAACCTACAGCTTCATTTAAAGAACTGCTGCCTTCAAATACAGAATTTCCTCCAACACAGAAGCAAAATGCAATTATTGAAAAGACAGCCAAGTTTATAGCACATCAAGGTACCCAAATGGAAATTCTTATTAAGGCCAAACAGGGTGACAATCCCCAATTTCAATTTCTCAACAAAGATTCATCACTACATCCCTACTACACGACCCTTATAGCATTGGTCAAAGCTGGAAAGTGGCCGGAGAAGGCAGAGGTTGTTGAAGAAAAACATGAAACAAATGAGGAGTACCTTCATCCAAGCTTAGCATCAACTGTTATAGAATCGGCTCCTTCAATACCGAGCATTCACTACAAGCCATCTGCAGATTGTGACTATACTTTACTGATATCAAAAATGAGGGGCGAGACATTGGATGAGTACTCCGACCTGGCGCCTGGCGAGGTGGCTCCCCCGGGAACTGAACCTGTGCTACCAAGGGCTGATATCATGAAGGCACCGGTCATGTACAATAGAGGAGAACCAGTTCCACCGATTCCTCAAACTGTGCAACATCATCAGTATGCAGCGTACTACACACAGTATATGGCTCATCTAACACAAGCACAGACACAAGCACACACGCCACAGACGACAGTTGAAAAACCAGCACCGACCATATCTCCAACGGAATCCACCGGTTTAAGTCTTATGAAGAACTACAACACGGACAGTGAAAGCGAAGAATCAGAATTAGAAGAGAGTTCTAAGCAAGGAAGCAAAGACAATGTGTTAGTTCCACCGAACGATATTCATGTGGTCATAGACAAGATGGCGGCATACGTAGCGCGGAACGGTGACGAGTTCGAAAAAATTGTTCGGTCTAAAAACGATCCGAGATTCACATTCTTAGATGACAGTAATATATATCATCCATACTACAAGAAATTGATGCTTGAGAAAAGAGGTGTCCCAAATGGAAAGGATAAAAACGAAATTGATAAAGCAATCCCCGTGTCGTTTTCAATAAAGAAACATAAAGAACCGGAACCGATTCTCCCGAAGCCAGCACTTCCATACGAGTCGAGCACAGATGGTGAGGAAGAAAATAAAGAAGCCGATAAACGTGAAGCACCTAAGGAAGTGCCAGTCACTAATATACCGCAGACACATCTCACTAACAACAGTAACAGTATTCCGCCTATTGTTGTATACAAAATGGAAGCTGTACAGGCCAACAATCTGCCTTTAGTGAAAACATATGAACCTTGCAACATTGGCAAAGCAGTTACTGAAGTCAGTGAGAATCCTAAGGAAGCGGTTGTTCCTCAAGTAGTTGAAAAAATTACAGAAAATGTCTTACAGCCCCCTGCTGAAATTAAAAAGGAGAATTCGCCGGAAAAAAAAGTTGTCGAACGGGCCAAAGACAAAGATAAGAGTCCCAGGGATAAGAGTCCTAAAGAAAGGAGTCCTAAAGAAAGGAGTCCTAAAGAAAAGAGTCCTAAAGATAAGAGCCCTAAAGATAAGAGTCCTAGGGATAAAAGTCCTAGAGACAGGAGCCCTAGAGATAAAAGGAAATCTAGAGATCGAAAAAAGGATTATAGATCAAGAAATGATAGGGAATCGAGAAGATCCGATCGGAGAAATGATGACAGAGACAAAGAACGCGAGAGAAAGAGAGACAGAGAGAGTGACAGGGATGGAAAAAGAAAGAAATATAAAGACGCCATAGAAACAGAAATTATATCATTAGAAGATAATTCTGATGAAATGATTGATTTGACCGGCGAGCAATCCGATTCCAGAGGCGAAGAGACGGAGGCGGATCGTTGCAAGCAGCAGCAGCGTCGTCGCCGTGCGGCGGAGTTCCTCCGACGTGTGTCCTCCTCCCGCACGAGGACTAGCCACGGGACCGACCGCAACCCTCGCCCACCGACCGCTACTCTACCGCACTCCTCACTAGCCAGCGCCATGGTTGACACATTAGAATCATTGTACAAAAAGAAAAACGAGGAAGACGAAAAGAAGAAGCGAAGAGAGAAACGACGGCAACGAGATAAAAGAGATTATGAAGAAGAATCCGACAGATACAAGAAAAACAAAAGAAGGAAGAATAGGTCTTCAGAAGAGGAAGACTCAGATGGACCGGGTTCCAAAAAGAAGAAGAGAAGGAAAGAGAAAAGCCATTCCTCTAAAAGCCAGAAAAAACCGAGAGACACAGAGATAGGTGAAAAGCCACAGCAAATTAACATCGATATAACGAACACGCTCAAGGAGCTAAGGAACTCATCCCCCACAAAAGAACTAAGATTAAGAGAGGAGAAGCTTTTAATAAAAGATAATTCCGATGGAGAGAAAAGTATGAAGAGCATAAAGAGAGATAGAGAGTACAGCGAGGGAGAGTGGTCCAGCGATAGTAATAATGACTCCGGCTTAAGTGACAACAATGCGGAACAAACCGTAGCTGGGAAATCAAATTAA

Protein sequence:

>DPOGS206863-PA
MSLKWTGNHNETGILRKSDASREKKEELFVFGYSCKLFRDDDKALHIDQGKHLIPWMGDETLKIDRYDARGALHDLVSLEAPPGGFDWRVELSRSEQDVEQLCDEERYRALHTDEDEEEMYKEEELKRLHAAGYGQVGFNYDAPAETPPEPPVEIEEPFEPTASFKELLPSNTEFPPTQKQNAIIEKTAKFIAHQGTQMEILIKAKQGDNPQFQFLNKDSSLHPYYTTLIALVKAGKWPEKAEVVEEKHETNEEYLHPSLASTVIESAPSIPSIHYKPSADCDYTLLISKMRGETLDEYSDLAPGEVAPPGTEPVLPRADIMKAPVMYNRGEPVPPIPQTVQHHQYAAYYTQYMAHLTQAQTQAHTPQTTVEKPAPTISPTESTGLSLMKNYNTDSESEESELEESSKQGSKDNVLVPPNDIHVVIDKMAAYVARNGDEFEKIVRSKNDPRFTFLDDSNIYHPYYKKLMLEKRGVPNGKDKNEIDKAIPVSFSIKKHKEPEPILPKPALPYESSTDGEEENKEADKREAPKEVPVTNIPQTHLTNNSNSIPPIVVYKMEAVQANNLPLVKTYEPCNIGKAVTEVSENPKEAVVPQVVEKITENVLQPPAEIKKENSPEKKVVERAKDKDKSPRDKSPKERSPKERSPKEKSPKDKSPKDKSPRDKSPRDRSPRDKRKSRDRKKDYRSRNDRESRRSDRRNDDRDKERERKRDRESDRDGKRKKYKDAIETEIISLEDNSDEMIDLTGEQSDSRGEETEADRCKQQQRRRRAAEFLRRVSSSRTRTSHGTDRNPRPPTATLPHSSLASAMVDTLESLYKKKNEEDEKKKRREKRRQRDKRDYEEESDRYKKNKRRKNRSSEEEDSDGPGSKKKKRRKEKSHSSKSQKKPRDTEIGEKPQQINIDITNTLKELRNSSPTKELRLREEKLLIKDNSDGEKSMKSIKRDREYSEGEWSSDSNNDSGLSDNNAEQTVAGKSN-