Monarch geneset OGS2.0

DPOGS215939
TranscriptDPOGS215939-TA1767 bp
ProteinDPOGS215939-PA588 aa
Genomic positionDPSCF300308 - 13043-17763
RNAseq coverage263x (Rank: top 41%)
Annotation
HeliconiusHMEL0076834e-15189.82% 
BombyxBGIBMGA001870-TA7e-15068.47% 
DrosophilaSrp54-PA5e-9663.38% 
EBI UniRef50UniRef50_E2BXG34e-10262.50%Probable splicing factor, arginine/serine-rich 7 n=3 Tax=Endopterygota RepID=E2BXG3_HARSA
NCBI RefSeqXP_001605226.18e-11167.36%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3071778512e-11166.20%Probable splicing factor, arginine/serine-rich 7 [Camponotus floridanus]
NCBI nr blastxgi|1937045142e-14449.69%PREDICTED: hypothetical protein LOC100161931 [Acyrthosiphon pisum]
Group
Gene OntologyGO:00001667.6e-08nucleotide binding
GO:00036766.2e-06nucleic acid binding
KEGG pathway 
InterPro domain[175-247] IPR0126777.6e-08Nucleotide-binding, alpha-beta plait
[174-233] IPR0005046.2e-06RNA recognition motif domain
Orthology groupMCL13187 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215939-TA
ATGGTTTCAAGCAGTACGAGGGTGATTCAAGTCACCAACATCGCCCCTCAAGCTACAAAAGACCAAATGCAAACCTTATTTGGTTATTTAGGAAAAATTGATGATATAAGATTGTACCCAACAATAAGAGACGTATCATGTCCGGTACAGTCTCGTATATGTTACGTGAAATATTATGATTCGGCGACAGTCAATGTCGCCCAGCATATGACAAATACGGTGTTTATAGATCGTGCTTTAATCGTGATTCCCATGCAGTCAGGAGAGATTCCTGACGAGCACCGAGCTCTAGAGATGTCAAGCAACGGAACTTTAGTGCCGGGTCTTAGTACAGTTGAACCACGATTACCAGCTCACGTGATCAACACTTTGGAAGGCGCACCACCCAACCAGGTCATTCAAACATACGATCCTAACATAGCAGCAGCGGGATTACCACCGTACCCGCCGCTTCCAGCCATTTATGACTCGAGGAAAATAGAAGAAATAAGAAGAACACTTTTACTTATAGATGTGGGCGAACTAACATCCCAACAACTCATTGATCATTTTTGTCAAGCTGGCGAAGTCAGCTACGTGCGATTTTGTGAACGGGAAGTTGACAACTTAAAGTATGCGCTGATAGAAATGACAGAACAAGAAAGCATATCAAAGGCTCTTCAGCTTAATGGAGTCGCATTAAATGGCCAAGTCATTAAGGTCCATCATTCTACGGTGGCTATATCAAAGCCTCAGGCTAAGAGCAATGAGGCAGCTCAAAGGGAGATCGAAGAGGCCATGTGCAGAGTTAAGGAAGCCCAGAACTTGATATCGGCTGCCATCGACCCCGTTATTGGATTGTTGTCTAAAGACAAAAGGACTCGTTCCCGGTCCCGGTCCCGCCGCCGCTCCCGGTCCCGGTCTCGTCGTTCCCGGTCCCGTCACCGCTCTAAGCGATCCCGCTCACGGTCCAGACATCGCTCCCGGAGATCACGCTCGAGGCACCGACACCGCACTAGGTCTCGTTCCCGTCACCGAAGCTCACGGCGCTCCAGGTCCAGATCCAGACACCGCAGCTCGAGATCCAAACGAGAGAAGTCGAAAGAACGCGATAGAAAAGACAAGAAAGACATCGGTGATAAGGAAAAGAGAGACAGTGATAAGACGAAGTCACCGCAGAAAGACGTGGGTAGGGACGGAAAAGACGAGCTCAAGATTGACATCAGCGAGGTTGATACGAACGGCAGCTCGTATGAACATAAATCTAAAGCCTCCACACCCGCTGATGATAAAGAAAAGACGACAGAGCTCGACAAGGACAAGTCGCCGAGGAAAAAGGAAAGGTCCCGCTCCAAGGAAAGGAAGAGGGAACGGTCGCGGTCGAAACGAAGGTCGCGGTCACGATCAAGAAGGAAACGCTCGAGGTCACGTAAAAGATCGAGGTCCAGGGACAGAAAGAAATCCCGCTCCAGAGAGAGGAAGAAGTCGAGGTCGCGGGACAGAAAACGGTCCAGGTCCAGGGACAGGAAGCGGACGAAGTCGAGGGAGAGGAAGAGGTCGCGGTCCAAAGATAGGAAAAGATCGCGCTCCAAGGATAGGAAGCGTTCGCGGTCACCCAGCAGGCGCTCCAAGAGCCGGTCCCATAGAGATTCCAAAACGCCTCACGAGAGGAAGTCACGTGACCACTCGCCGCTACCAGCAATAATGGAAAAGACTCCACACAAAACTATAGACGTGACAGATGAAAAGAATTCCCCAGACAATATGGACATTTCAAATTCCCCATAA

Protein sequence:

>DPOGS215939-PA
MVSSSTRVIQVTNIAPQATKDQMQTLFGYLGKIDDIRLYPTIRDVSCPVQSRICYVKYYDSATVNVAQHMTNTVFIDRALIVIPMQSGEIPDEHRALEMSSNGTLVPGLSTVEPRLPAHVINTLEGAPPNQVIQTYDPNIAAAGLPPYPPLPAIYDSRKIEEIRRTLLLIDVGELTSQQLIDHFCQAGEVSYVRFCEREVDNLKYALIEMTEQESISKALQLNGVALNGQVIKVHHSTVAISKPQAKSNEAAQREIEEAMCRVKEAQNLISAAIDPVIGLLSKDKRTRSRSRSRRRSRSRSRRSRSRHRSKRSRSRSRHRSRRSRSRHRHRTRSRSRHRSSRRSRSRSRHRSSRSKREKSKERDRKDKKDIGDKEKRDSDKTKSPQKDVGRDGKDELKIDISEVDTNGSSYEHKSKASTPADDKEKTTELDKDKSPRKKERSRSKERKRERSRSKRRSRSRSRRKRSRSRKRSRSRDRKKSRSRERKKSRSRDRKRSRSRDRKRTKSRERKRSRSKDRKRSRSKDRKRSRSPSRRSKSRSHRDSKTPHERKSRDHSPLPAIMEKTPHKTIDVTDEKNSPDNMDISNSP-