Monarch geneset OGS2.0

DPOGS201595
TranscriptDPOGS201595-TA3084 bp
ProteinDPOGS201595-PA1027 aa
Genomic positionDPSCF300152 + 16936-31258
RNAseq coverage1205x (Rank: top 10%)
Annotation
HeliconiusHMEL0101650.057.76% 
BombyxBGIBMGA010670-TA6e-0722.17% 
DrosophilaCG42724-PC1e-9551.55% 
EBI UniRef50UniRef50_E0VMQ92e-12640.75%Transcription elongation regulator, putative n=4 Tax=Coelomata RepID=E0VMQ9_PEDHC
NCBI RefSeqXP_002427403.13e-12740.75%transcription elongation regulator, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700024394e-13544.23%hypothetical protein TcasGA2_TC004501 [Tribolium castaneum]
NCBI nr blastxgi|2700024390.042.86%hypothetical protein TcasGA2_TC004501 [Tribolium castaneum]
Group
Gene OntologyGO:00055155e-10protein binding
KEGG pathwayphu:Phum_PHUM3176708e-127 
 K12824 (TCERG1, CA150)maps-> Spliceosome
InterPro domain[647-712] IPR0027139.1e-17FF domain
[401-427] IPR0012025e-10WW/Rsp5/WWP
Orthology groupMCL12385 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201595-TA
ATGGAAGACGAATACAGTATAGACCCTCAGGACTCTACTATGAGTGAATTAAACCGTAGCGATGTTAGTGAAATTACGGATTCTTCTAAGGATTTTTGTGAGCCGGGGGGTTATGATGAAAATTACGAAGATGAGGAAGGCGCGAACGGCTTCAGAGGCGGTAGAGGCGGTCGCGGGACGTTCAGGGGTCGCGGGCGAGGTCCGCCGCCTGGTTGGATGAATGGGCCTCCGCCCATGCGCTTCAGGGGCCGCGGCTACGGTCCCGGCGGGCCTCCTATGAGAGGTCGCGGATTCTTCCGTGGGCGCGGTGGTCCGAGAGGATATGGTCCCAATAATGGTCCTAATTACGAAAATAACTGGGGACCCATGGGTCCCCCGCCGCCTGGTATGATGGGAGGTCCTCCACCTTATGGACCACCTCCTGGTATGATGCCGCCAGGTGGACCGATGGGCCCCCCGCCCAACATGATGGGGCAGCCGCCGCCCTTCGGACCTCCAGGGATGCCACCACCAAATATGCCCGCTCCGGAGCTGTGGGTTGAGACCAAGTCAGATGAGGGCAAGTCGTATTACTACCACGCCAGGACTAGAGAGACGACCTGGACCAGACCCCAGGAGAGTCCCACGTGCAAGGTCATCACGCAGGCTGAGGTGGATGTCATGACAGCTGCTGGCCAGTATCCCGGCATGAATCAGTCGATGCCCATGAACGGTCCCATGGGCGGGATGGGCGGTCCCATGGGCGCGCCAATGAACGGCCCCATGGGCATGATGGGCATGATGCCGCCAGGAGTCGGCCACGGGCCAACACCGGGGTCCGTCCCGCCCTTCATGAACCAACCGCCACCCTGGGTCAAGGATAATAACCAGATGCAGTCGAAGCTGGACAAACAGGACAGCTCACCTGATGATGAAGCGCCCCCGGGCGAGGCACCCTCACAGGCTACACAACCACCCGGCACGGGCCCCCTAGGCCCAGCGCCGGGTGCCGGCGGGCCGTGGGGTTGGGGCTGGGCCCCGCCGCTGGTGGCGCAGCCCCCTGGCCTGGCGGCCGCCTCAGCAGCCGTACCTGACACTGGCGCAGCAACTCAGACACAGCCCATCGCGGTCATGGGAAATGACGCACAACCGGACAGCACAGTGACGCCCAAGAAAGAGGAAACGGTGATACCTCCCGAACTGTCTCTACGTGCTGGGGAGTGGACCACACACAGGGCTCCGGACGGCAGGCCATACTACTATCACGCTGGCACCAGGCAGAGTGTGTGGGAGAAACCGCAGCCCCTCAAGGAGTTTGAAGAACTACAAAACAAAATAGCCAAAGAGAAAGGCGAAAAGCTGGACGTCAAGAAGGACAGCAGGGTTATTGACGACGGCAAAATAGAAGTTATAGATGTAGAAGCTCACGCTGAGGCCGCGGCAGCTGCAGAGGCTGCTGAGAGGGAGAGATTAGAGAGAGAGCGGCTGGAGAGAGAGAGAATAGAGAAGGAGAGGTTGGAGAAAGAAAGGTTAGAGAAGGAGAAACAGGAGAAGGAGAAGGCTAAGACGGATAAGAGTAGACCGGTTTCAAGCACTCCCATATCTGGAACACCTTGGTGCGTTGTATGGACGGGTGACGGCAGAGTGTTCTTCTACAATCCGACGGCGCGTCTGTCAGTGTGGGAGCGCCCGGCACAGCTGGCGGGGAGAGCGGACGTGGATCAAGCGGTGTCTCACCCGCCCCACCAGAGGGATCAGCAGCGGAAGGAACCGCCAGCGACCACGACCGTCACGCCGGCCAAGAACGCTAACGGGGAACTGAAGAGGGGAGCGTCCGACTCATCCGACTCGGAGACTGAACCGGCCAAGAAGGCGAAATCCGAAGAAACCAAGAAGAAGTCTGGCGTGTCAGCCGGCGTGATAGATATGGGCAAGGAGGCGGCCAGGGAGGCGATGGCCAGGGCGGAACGCGAGAGGGCGCTGGTGCCGTTCGAACAGCGCGTGAGGGCCTTCCTTCAGATGCTGCACGAGAGCGACGTGTCAGCGTTCTCGCCGTGGGAGAAGGAACTGCATAAGATTGTGTTCGACAGCAGATATCTGCTGCTAGAGTCGAAGGAGAGGAAACAGGTGTTCGATAAGTACGTGAGGGAGCGAGCTGAGGAGGAGCGTAAAGAGAAGAAGAACAGGATCCAGCAGAAGAAGCAGGCCTTCAGGGCGCTCATGGACGAAGCCAAGCTGCACTCCAAATCTTCCTTCACCGAGTTCTCCGGCAAGTACAGCAGGGATGAGAGGTTCAAAAATATTGAGAAGATGAGGGATAGGGAGACTTACTTCAACGAGTACATCGCTGAGGTCCGGAAGAAGGAGAAGGATGACAAGGACAGGAAGAGGGAACAGGCCAAAACGGAGTACTTAGCGCTTTTGAAAGAAAAGAGTGTTGACAGGCACTCTAGATGGTTGGACGTTAAGAAGAAGATAGACTCGGACGCTAGGTACAAGGCCGTGGAGAGTAGCTCGCTGAGGGAGGACTACTTCAGGGAGTACTGCAAGATGGTTAAGGAGGAGAAGAAGAAGGAGAAGGACGGCAAGGAGAAGGAACGTGAGAGGGGCAATAAGAAGGACAAGAAGGACAAAGAGAGGGAGAAAGAGAAGGACCGCGAGAAGGAGACGAAGAAGGAGAAGAAGAAAGAGAAGCCTGCTGACAAACAGTTGGACCAGTCCACTGAGGATGAAAAGAAGCAGCCCACCCCCCCACCGGACCAGTGGGCCGAGATCCTGGGGATACCGGGGGAAGAGAAGGAGAAAGAGAAGGAAAAGGAGAGGGAGAGGGAGAAGAACGCCAAAGAAGCTAAGACGAAGGACAAGAAGGAGTCTGAGAATAGCGAGATGGAACCACCGTCGTCCGAAAAGGAGATGCTGTCACCGAAACAGCTGAGATCCTCGAAGAAAGATCCGCAGAATCAGGAGAAGAAATCCCCAATAAAGTCACCGCCCAAGAAGAGAAAGCAGGAGTTCAAGTCGCCGGAGCCGGAGGCGGAGAAGAAGGGCAAGAAGAAGACTGAGAAGACGGAGAAGAGGCGGCGGAAGAAGTCCGAGAAAGAATAG

Protein sequence:

>DPOGS201595-PA
MEDEYSIDPQDSTMSELNRSDVSEITDSSKDFCEPGGYDENYEDEEGANGFRGGRGGRGTFRGRGRGPPPGWMNGPPPMRFRGRGYGPGGPPMRGRGFFRGRGGPRGYGPNNGPNYENNWGPMGPPPPGMMGGPPPYGPPPGMMPPGGPMGPPPNMMGQPPPFGPPGMPPPNMPAPELWVETKSDEGKSYYYHARTRETTWTRPQESPTCKVITQAEVDVMTAAGQYPGMNQSMPMNGPMGGMGGPMGAPMNGPMGMMGMMPPGVGHGPTPGSVPPFMNQPPPWVKDNNQMQSKLDKQDSSPDDEAPPGEAPSQATQPPGTGPLGPAPGAGGPWGWGWAPPLVAQPPGLAAASAAVPDTGAATQTQPIAVMGNDAQPDSTVTPKKEETVIPPELSLRAGEWTTHRAPDGRPYYYHAGTRQSVWEKPQPLKEFEELQNKIAKEKGEKLDVKKDSRVIDDGKIEVIDVEAHAEAAAAAEAAERERLERERLERERIEKERLEKERLEKEKQEKEKAKTDKSRPVSSTPISGTPWCVVWTGDGRVFFYNPTARLSVWERPAQLAGRADVDQAVSHPPHQRDQQRKEPPATTTVTPAKNANGELKRGASDSSDSETEPAKKAKSEETKKKSGVSAGVIDMGKEAAREAMARAERERALVPFEQRVRAFLQMLHESDVSAFSPWEKELHKIVFDSRYLLLESKERKQVFDKYVRERAEEERKEKKNRIQQKKQAFRALMDEAKLHSKSSFTEFSGKYSRDERFKNIEKMRDRETYFNEYIAEVRKKEKDDKDRKREQAKTEYLALLKEKSVDRHSRWLDVKKKIDSDARYKAVESSSLREDYFREYCKMVKEEKKKEKDGKEKERERGNKKDKKDKEREKEKDREKETKKEKKKEKPADKQLDQSTEDEKKQPTPPPDQWAEILGIPGEEKEKEKEKEREREKNAKEAKTKDKKESENSEMEPPSSEKEMLSPKQLRSSKKDPQNQEKKSPIKSPPKKRKQEFKSPEPEAEKKGKKKTEKTEKRRRKKSEKE-