Monarch geneset OGS2.0

DPOGS200847
TranscriptDPOGS200847-TA2271 bp
ProteinDPOGS200847-PA756 aa
Genomic positionDPSCF300071 - 125497-132147
RNAseq coverage835x (Rank: top 15%)
Annotation
HeliconiusHMEL0123400.085.06% 
BombyxBGIBMGA009903-TA0.086.31% 
DrosophilaCG6418-PB0.058.59% 
EBI UniRef50UniRef50_B3M5L40.060.13%GF23818 n=1 Tax=Drosophila ananassae RepID=B3M5L4_DROAN
NCBI RefSeqXP_624210.10.061.62%PREDICTED: similar to CG6418-PB [Apis mellifera]
NCBI nr blastpgi|3227993980.064.83%hypothetical protein SINV_08125 [Solenopsis invicta]
NCBI nr blastxgi|3838574490.062.91%PREDICTED: ATP-dependent RNA helicase DDX42-like [Megachile rotundata]
Group
Gene OntologyGO:00055241.1e-42ATP binding
GO:00080261.1e-42ATP-dependent helicase activity
GO:00036761.1e-42nucleic acid binding
GO:00043862e-28helicase activity
KEGG pathwayame:5518220.0 
 K12835 (DDX42, SF3B125)maps-> Spliceosome
InterPro domain[278-478] IPR0140012.1e-57DEAD-like helicase
[283-452] IPR0115451.1e-42DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[516-597] IPR0016502e-28Helicase, C-terminal
Orthology groupMCL13907 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200847-TA
ATGAGTTACAATTCGAGTGGGGGCAAGGGGTTCGGATTTTCCGGATTCACGATGCCAAAACGGAACCCTTCGAACGTTTCACTACACGCTGTTCCTCCGCCTCCTTCCCAAATGAATGCTGTCCCTCCTCCGAATCCCGGGTTATCGAAGCAGGGTTATTCAACTATGAACGCGATAACACAGAACGCCATCGGTGGTACTTGGGGTACACTTGGGAAAAAGCGAGCTAAGACTGAAGATGAGTATTTTGATGAAGATGATGAGCCTCCGACTCCAGCTCTTGCTTATATACCAGCACCAGGGAGCCCTACTAATGAAGCCAACAGCTCTAGGGCAAATGAAGAAGAGGAAGATCCCCTGGACGCGTACATGGCCGGTCTGGAGAAACAGGCGGCTAAAGATATGAAGGTTAGCAAAGAGAATGCCGTTAGTGGAAAAGGTGACGCCGGGAGAGGGACCAGAGGAGATATTGATGAGATGGACGACGAAGAAAGTTACTATAAATACATGGAAGATAATCCACTCCAGACAGCCGATGATGGCTCCGACGTTGAAATAGAGTATGATGAGGATGGAAATCCTATCGCTCCTCCTAAGAAAAAGTTTATAGATCCGCTGCCACCGATAGACCATTCGGAGATTCAGTATGAACCGTTCGAAAAGAACTTCTATACACCACACGAAGATATAGAGAAACTAGAGCAGCATCAGGTGGAAGAGCTGAAGAAAAACTTGGGGGTCAAGATTTCCGGACCTGATCCTCCGAAACCTGTGAGTAGTTTCGGTCATTTGGGCTTCGATGAACAGCTGATGAAGGCTATTCGAAAGTCAGAGTACACTCAGCCGACGCCGGTGCAGGCGGCTGGCATACCAGCGGCGCTCTCCGGAAGGGATCTCATAGGTATTGCCCGCACTGGTTCTGGTAAAACGGCAGCATTCCTCTGGCCACTGCTCGTCCACATCATGGATCAGAAAGAGTTGGCTCCGGGGGATGGGCCCATCGGACTCATACTGGCCCCCACTTCCCTCAACCGAATATACATGGAGGCGAAGAAATTTGGCAAAGTATACAACATCAGATGTGTTTGTTGTTATGGAGGGGGGTCCAAGTGGGAGCAGAGTAAGGCTTTGGAAGGGGGCGCGGAGATAGTCGTTGGCACTCCGGGGCGGGTCATCGACCTGGTGAAATGCAAGGCGACCAATCTTCAGCGCGTCACGTACCTGGTGCTGGACGAGGCCGACCGGATGTTCGACATGGGGTTTGAGCCTCAGGTCCGTTCCATCTGCAGTCACGTCCGTCCTGAGCGCCAGGCCCTGCTGTTCTCCGCGACCTTCCCTCGTCGCGTGGAGCGCCTCGCCCGTGACGCTCTTCACGACCCCGTGCGAGTCCAACACGGAGCGGCCGGAGAAGCCTCCAAGCTGGTGAAACAACGTGTCACTATCTTCAATAAACCGGAAGAGAAGTGGCCCTGGCTGTTGGAGAATTTAGTCGACTTCCTGTCGTCGGGGAGCGTGTTGATATTTGTTACGAAGAAGTTGGAAGCGGAACAGACAGCAGCAAACCTCGGCGTGCAGCAGTATGACGCGCTGCTGCTGCACGGAGACCTGGAGCAGGCGGACAGGAACAAGGTCATCACGGCCTTCAAGAGACAGGAGAGCAACATACTCGTCGCCACCGACGTAGCTGCTCGCGGTTTGGACATCCCTCACATCCGCACGGTGGTGAACTACACCGTGGCGCGCGACATCGACACACACACACACAGAGTGGGCCGCACGGGGAGGGCCGGCGTCCCGGGGACGGCGCACACGCTGCTGTCCCGGGACAGGGACAAGGACTTCGCGGGACACCTGCTCAGGAACCTTGAGGGAGTGCAGCAGGAGGTGCCGGAGGAGTTGATGCAGCTAGCGATGCAGTCAACGTGGTTCCGGAAATCACGGTTCAAGAAGGGGAAGGGCAAGAATCTGAACATAGGCGGCTGCGGACTCGGTTACAAAGAGCGTCCCGGGCTGCCCGCCTACAACGACGAGGTGTCTCTCACAGCGAGCGTGGAGAAGACGGTAGAGAAGGCCGGGGGCCCCGCCACCGACCGCCTCGCCTCGCTCAAACAAGCCTTCCGCTCACAATACAACCAGTTCACCGCGTCGTCTGACCACTCGTGGGAGCAGACGCGGCCCGTCCTCCAGCCGGGGGTGAACGCGCCGGCCAACGCGAACACGGACAAAACCGAGAGACTGCGCAAGAGCGGCAAGAAGAGCCGCTGGGAATAG

Protein sequence:

>DPOGS200847-PA
MSYNSSGGKGFGFSGFTMPKRNPSNVSLHAVPPPPSQMNAVPPPNPGLSKQGYSTMNAITQNAIGGTWGTLGKKRAKTEDEYFDEDDEPPTPALAYIPAPGSPTNEANSSRANEEEEDPLDAYMAGLEKQAAKDMKVSKENAVSGKGDAGRGTRGDIDEMDDEESYYKYMEDNPLQTADDGSDVEIEYDEDGNPIAPPKKKFIDPLPPIDHSEIQYEPFEKNFYTPHEDIEKLEQHQVEELKKNLGVKISGPDPPKPVSSFGHLGFDEQLMKAIRKSEYTQPTPVQAAGIPAALSGRDLIGIARTGSGKTAAFLWPLLVHIMDQKELAPGDGPIGLILAPTSLNRIYMEAKKFGKVYNIRCVCCYGGGSKWEQSKALEGGAEIVVGTPGRVIDLVKCKATNLQRVTYLVLDEADRMFDMGFEPQVRSICSHVRPERQALLFSATFPRRVERLARDALHDPVRVQHGAAGEASKLVKQRVTIFNKPEEKWPWLLENLVDFLSSGSVLIFVTKKLEAEQTAANLGVQQYDALLLHGDLEQADRNKVITAFKRQESNILVATDVAARGLDIPHIRTVVNYTVARDIDTHTHRVGRTGRAGVPGTAHTLLSRDRDKDFAGHLLRNLEGVQQEVPEELMQLAMQSTWFRKSRFKKGKGKNLNIGGCGLGYKERPGLPAYNDEVSLTASVEKTVEKAGGPATDRLASLKQAFRSQYNQFTASSDHSWEQTRPVLQPGVNAPANANTDKTERLRKSGKKSRWE-