Monarch geneset OGS2.0

DPOGS214990
TranscriptDPOGS214990-TA2076 bp
ProteinDPOGS214990-PA691 aa
Genomic positionDPSCF300256 - 165825-171743
RNAseq coverage314x (Rank: top 36%)
Annotation
HeliconiusHMEL0101760.080.77% 
BombyxBGIBMGA012163-TA0.075.25% 
DrosophilaCG7757-PA8e-17555.91% 
EBI UniRef50UniRef50_E3X2H81e-17949.26%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3X2H8_ANODA
NCBI RefSeqXP_001661880.10.054.71%Trisn small nuclear ribonucleoprotein, putative [Aedes aegypti]
NCBI nr blastpgi|1571303040.054.71%Trisn small nuclear ribonucleoprotein, putative [Aedes aegypti]
NCBI nr blastxgi|1571303040.054.86%Trisn small nuclear ribonucleoprotein, putative [Aedes aegypti]
Group
KEGG pathwayaag:AaeL_AAEL0117450.0 
 K12843 (PRPF3, PRP3)maps-> Spliceosome
InterPro domain[300-536] IPR0138811.5e-60Pre-mRNA-splicing factor 3
[558-678] IPR0105411.4e-33Domain of unknown function DUF1115
Orthology groupMCL14106 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214990-TA
ATGGCGTTACAGTTGTCCAAACGTGAAGTAGAAGACCTAAGATCCTCCCTCGATCGTGCAATCTATAGAACTATAGGAAAATCAGATAGTTCACTACTATACACGGTGTCTTCGTGCCTGACAAACGGGTATGAGCGCCGTAAGATTATTGATAAAATATCATCACACATTGATTCGAAGAAGGCCAGCAAACTTGCTGACAAGATCATAGCCCTCGCGCAGGAGCTGATCTCATCATCCAAGAGTCAGAAACGGAAATATGAAGATAAAGAAAAAGACAAAGACAGTAAAAGATCTCGTCACGAATCCCGCGAGGAGAGACGGGAGAAGGATGACAGAGAGAGAAAGAGTGACAACGGAGAGGAGCTACCGACCATCTCGGACGGGGACACCATTGGCTCAAAGATGACTGGACTCAGTGCTGATAAGATTAAGGTCATGATGGCGAATGCTCAGAAGGAAATTGAGGAGAGGAAGCGAGCTCTGATGGCTATCAAGGGGGAGTCTCGGAACGTGAGCACGGCCGCGGCCGCCGCGGTCGTCGAGTCCCGGGTGCACCGCGGGGGCATAGCACCCCCTAGCGTCATCAAGCCGATATTGTACTCCAAACCAGGTCGGGTCACGCCGACAACCGCCGAGGAGTTGGAGAAGCAGAGGAAGATAGCGGAGCTGCAGGCCAGGATACAGAGGAAGCTGGCGGGTGGCGCGCTGGCTGCGACCGGGGGCTCCGGGCCCGCGCCCCTCATACTGGACAGGGAGGGTCGCACCGTGGACACCAGCGGCAAGAGGGTGCAGCTCACACACGTGGCGCCCACCTTGAAGGCCAACATAAGGGCGAAGCGGCGCGAGGAGTTCCGCGCCCAGCTGAGCGGGCAGACCACGGAGGCGGTCAACGAAGCCCCCTGGCAGGACGAGCGGCTGGCCAGCAAGCCCCCGGCGAGAACGCGCAGGGCGCTCCGCTTCCACGAGCCCGGGAAGTTCACACAGTTGGCGGAGAGACTCCGTATGAAGGCCCAGCTGGAGAAGCTGCAGACTGAGATATCTCAGATAGCTCGGAAGACGGGCATCTCCTCCGCCACCAAACTGGCGTTACTGGCCGCCGACACGCCGGAGGCACAGAGAGTGCCGGACATAGAGTGGTGGGACAGCGTGATCCTGATGACCCCCGAGGAGAGGGAGGCGAGGGCGAAGGCCGGCGACGACGAGAGGTCATTCAGCGAGCGCGTGGAGGCCTGCAACACGGGACACGACGACATCGTGGAGAACCTCAACGAGGACGCCATCACCAACCTGGTGGAACACCCGCAGCAGCTCAGACCGCCCACCGAACCTCTGAAACCGACTTACATGCCGGTGTTCCTCACCAAGAAGGAAAGGAAGAAGCTCCGCAGGCAGAGCAGGAGAGAGGCCTGGAAGGAGGAGCAGGAGAAGGTGCGCCTGGGGCTGGAGGCGCCGCCGGAGCCCAAGCTCAGGATATCCAACCTGATGCGCGCCCTGGGGACGGAGGCTGTGCAGGACCCCACGGCCATTGAGGCCAGGGTCAGGGAGCAGATCGCCAAGAGGCAGAAGACACACCTCGAGGCCAACAAGGCGAGGGCTCTCACCAAGGAACAGCGGAGAGAGAAGGTGGATAGGAAAATACGCGAGGACACGTCTATGGGGGTGCACGTGGCGGTGTACAGGGTGAAGGACCTGTTCGAGAGCGCGTCCGCCAAGTTCAAGGTGGAGGTGAACGCGCGCCAACTGCACATGACGGGCTGCGTGGTGCTGCACCGAGCCTGCTGCGTGCTGGTGCTGGAGGGGGGCCCGCGGCAGCACGAGAAGTACAAGCGCCTGATGCTGCACAGGATAAAGTGGGAAGAGGAGACCGTGAAGAACGCCGACGACAGCGAGGGTCCAAACTCGTGTACGCTGGTCTGGGAGGGGGTCGCCGCGAGGAGGGCCTTCGGGGACATTAAGTTTAAGGTGATGCCGACGGAGAAGCAGGCGAGGGAGTTCTTCGCCAAGCACGGCGTCGAGCATTACTGGGACCTATCGTACAGCGGGGCCGTGCTGGGGCCAGCGGAGGAGCCCTAG

Protein sequence:

>DPOGS214990-PA
MALQLSKREVEDLRSSLDRAIYRTIGKSDSSLLYTVSSCLTNGYERRKIIDKISSHIDSKKASKLADKIIALAQELISSSKSQKRKYEDKEKDKDSKRSRHESREERREKDDRERKSDNGEELPTISDGDTIGSKMTGLSADKIKVMMANAQKEIEERKRALMAIKGESRNVSTAAAAAVVESRVHRGGIAPPSVIKPILYSKPGRVTPTTAEELEKQRKIAELQARIQRKLAGGALAATGGSGPAPLILDREGRTVDTSGKRVQLTHVAPTLKANIRAKRREEFRAQLSGQTTEAVNEAPWQDERLASKPPARTRRALRFHEPGKFTQLAERLRMKAQLEKLQTEISQIARKTGISSATKLALLAADTPEAQRVPDIEWWDSVILMTPEEREARAKAGDDERSFSERVEACNTGHDDIVENLNEDAITNLVEHPQQLRPPTEPLKPTYMPVFLTKKERKKLRRQSRREAWKEEQEKVRLGLEAPPEPKLRISNLMRALGTEAVQDPTAIEARVREQIAKRQKTHLEANKARALTKEQRREKVDRKIREDTSMGVHVAVYRVKDLFESASAKFKVEVNARQLHMTGCVVLHRACCVLVLEGGPRQHEKYKRLMLHRIKWEEETVKNADDSEGPNSCTLVWEGVAARRAFGDIKFKVMPTEKQAREFFAKHGVEHYWDLSYSGAVLGPAEEP-