Monarch geneset OGS2.0

DPOGS207282
TranscriptDPOGS207282-TA3144 bp
ProteinDPOGS207282-PA1047 aa
Genomic positionDPSCF300008 + 53076-60776
RNAseq coverage6303x (Rank: top 2%)
Annotation
HeliconiusHMEL0157630.077.12% 
BombyxBGIBMGA011746-TA0.089.76% 
DrosophilaRm62-PA0.071.93% 
EBI UniRef50UniRef50_D7EJG70.075.26%Rm62 n=1 Tax=Tribolium castaneum RepID=D7EJG7_TRICA
NCBI RefSeqNP_001037582.10.089.76%DEAD box polypeptide 5 isoform 1 [Bombyx mori]
NCBI nr blastpgi|1839793150.089.02%DEAD box polypeptide 5 [Papilio xuthus]
NCBI nr blastxgi|1839793150.089.96%DEAD box polypeptide 5 [Papilio xuthus]
Group
Gene OntologyGO:00055242.8e-50ATP binding
GO:00080262.8e-50ATP-dependent helicase activity
GO:00036762.8e-50nucleic acid binding
GO:00043861e-33helicase activity
KEGG pathwaytca:1001417690.0 
 K12823 (DDX5, DBP2)maps-> Spliceosome
InterPro domain[170-373] IPR0140011e-65DEAD-like helicase
[175-346] IPR0115452.8e-50DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[878-959] IPR0016501e-33Helicase, C-terminal
Orthology groupMCL10060 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207282-TA
ATGGCAATATACATTATACATTCATTCAATAAAAGGCTCATTAATTCTGTAGCCGCCATTTTAATCTCTCTTGCAAAGGCAGTGGTCAAGGTGAGACGTGTGTTGAATTTAGCTTATGGTGATAAGTCAAACAACCATTGGAACAATAGCCGTGGTGAAAACGGTGGTTCCAAATTCGGAGGAGGGGGCAAGTTCGGAGGTAATGGAACATCACGATTTAATCATGGAGGCTCTAAATTTGGTGGTGGTGGTGGGGGCGGAGGAAAGAAAGAGTTTTCCGGAGGCCAAAGTATGAGACGACCAAACTGGGACACAATGTCCTTACAACCGTTTAACAAAGATTTCTACAACCCGCCACCATCAGTCCTAAACAGATCACCATATGAGGTTGAGGAGTACAGAAACAAGCATGAGGTTTCCGTCAGCGGAGCCGATGTTCCTAACCCGATACAGCATTTTGAAGAAGGAAACTTCCCAGACTATGTTATGAAAAGCATTTCAAGCATGGGTTACAATGAGCCCACGCCTATCCAGGCTCAAGGGTGGCCGATTGCCATGTCTGGAAAGAATTTAGTTGGTATTGCACAGACTGGATCTGGAAAAACACTGGCGTACATTCTGCCTGCCATTGTTCATATCAATAACCAACAGCCCGTAAGAAGGGGTGACGGTCCAGTTGCACTTGTTTTGGCGCCAACTAGGGAATTAGCACAGCAGATACAACAGGTTGCCACAGATTTTGGTAATGCCGCATATGTGCGTAACACGTGCGTCTTTGGTGGTGCACCTAAGAGGGAACAAGCCCGTGATTTGGAGAGAGGGGTTGAAATTGTGATTGCTACCCCTGGAAGGTTAATTGATTTCTTAGAAAAGGGTACAACAAATCTTCAAAGATGTACTTATCTAGTGCTTGATGAAGCTGATCGCATGTTGGACATGGGTTTTGAGCCTCAAATACGAAAGATCATAGAGCAGATTCGCCCTGATAGACAAACTCTCATGTGGTCGGCCACATGGCCTAAAGAGGTTAGAAAATTAGCTGAAGACTATCTTGGTGACTATGTCCAGATTAACATTGGATCAATGCAGTTATCTGCAAATCACAACATTCTTCAAATTGTTGATGTGTGTCAAGAACATGAGAAAGAAAATAAGTTAAACACTTTACTACAAGAAATAGGTCAAAGTCAAGATCCAGGTTCAAAGACGATCATATTTGTGGAGACAAAGAGAAAAGTTGAAAATATCACTAGAAATATTAGACGTTATGGCTGGCCAGCGGTTTGCATGCACGGTGACAAGACACAGCAGGAAAGAGATGATGTCTTATATCAATTTAAACAAGGAAGAGCCAACATACTTGTTGCAACAGATGTAGCTGCCCGAGGACTTGATGTTGATGGTATCAAATATGTAATTAACTTTGATTACCCAAATTCATCTGAGGACTACATTCACCGGATTGGAAGAACGGGGAGATCCAAATCAAAAGGCACGTCATACGCCTTCTTCACACCTTCAAATTCCCGTCAAGCTAAAGACCTTGTATCAGTGCTTCAGGAAGCCAACCAGGTTGTTAGTCCTCAATTGCAAACCATGGCTGACCGTTGTGGAGGTGGTGGTGGAGGAGGATGGAACAGGAATAGGTGTTGCAGAGATATAACTAATATCTGCTTTAATAAATCCTGTTTTGCTGCAAAAAACAGTCGAATACTTACCTACACAACTTTATCAAGTTACCAAAAACATCAGTTTATAAGATACTCTTCGGTACCTGCTGCTCAAAGTGATGCTGATTATTGCCGTGAAAATAAAATAACAATAATAGGCGATGACATTCCCAGTCCAGTAAGAGACTTGGACAGTGGAAACTTTCCAGATTATATTAAGAATTTTCTTCAAGAGCAGGGCTTTACCAAGCCTACACTGATCCAGTCCCAAGGATGGCCAATTGCTATGGCGGGGAAAAATTTTGTTGGTATCGCTCAAACAGGTACAGGTAAAACTCTTGCATATTTGCTGCCAGCAGTTATTCAACTAAAAGAAAATAAAGGACGAAGGGGTAAGGGTCCAAGAGCATTAGTACTGGCTCCTACAAGAGAACTTGCAAGACAGATAGAGGAAGTTGCTAAAGATTTTGAAAGGCTTTTGAACATCCGTTGTCTATGTATATATGGAGGTGTGAGCAGATCTAATCAAGCTCAACAGTTGCAACGGGGTGTAGATATCCTAATTGCTACACCTGGCAGATTAAATGATTTTCTTAACAGCAGAGTGACGACTTTAAGCAGATGTACATATGTGGTGTTGGATGAAGCTGATAGAATGTTAGACATGGGCTTTGAACCTCAGATCAGGCAAGCATTGGAAGATGTACCATATGAAAGACAAATCCTTATGTTCTCAGCAACATGGCCTAAGGAAGTACAACATTTAGCTAAAGATTATCTAGGAGAATTTGTACAGGTTAATGTGGGTTCTACAGAATTGACAGCCAATCATAATATCAAACAATGCATATATGTTTGTGAACAAGACCAAAAAATGGATAAATTCAAATCTATCATGCATGAAATATCAGGCAATGGTTTTGGTAAGGTTCTGGTGTTCACAAATACAAAGAAATTTGTAGATAGCTTGACACTGGCACTACAAAGAAATGGCTGGCCGGCGGTCGGCATACATGGTGACAAAACGCAACTTCAAAGGGATATTATTATTAATAAATTTAGAAGCGGAAAAACTAATATTCTTGTTGCCACAGATGTTGCTGCAAGAGGCTTAGATGTTGATGGTGTAACACATGTTGTGAACTATGATTTTCCAAACACATCAGAAGATTATATCCACAGAATTGGAAGAACTGGCAGATCAGATAATAAGGGTGTAGCTCATACTATCTTAACAAGCGAGAATGCTCGACAAGCAAGAAGTCTTATACAAGTACTCAAGGAAGCAAAACAGGAAGTTCCACATGAATTGGAACAGCTGTGCCGTGATTACGGCAGCATGAAGTTCAAGGAGCAGCAGACGAAATATAAACCTAAAAACAATTATAAATGGAAGAATTATAATAACAGGAATTATAACAGAGACGGTTGGAACTCACGCGACAAGTTTGGCAGTATGCGTGAAATGTATTAG

Protein sequence:

>DPOGS207282-PA
MAIYIIHSFNKRLINSVAAILISLAKAVVKVRRVLNLAYGDKSNNHWNNSRGENGGSKFGGGGKFGGNGTSRFNHGGSKFGGGGGGGGKKEFSGGQSMRRPNWDTMSLQPFNKDFYNPPPSVLNRSPYEVEEYRNKHEVSVSGADVPNPIQHFEEGNFPDYVMKSISSMGYNEPTPIQAQGWPIAMSGKNLVGIAQTGSGKTLAYILPAIVHINNQQPVRRGDGPVALVLAPTRELAQQIQQVATDFGNAAYVRNTCVFGGAPKREQARDLERGVEIVIATPGRLIDFLEKGTTNLQRCTYLVLDEADRMLDMGFEPQIRKIIEQIRPDRQTLMWSATWPKEVRKLAEDYLGDYVQINIGSMQLSANHNILQIVDVCQEHEKENKLNTLLQEIGQSQDPGSKTIIFVETKRKVENITRNIRRYGWPAVCMHGDKTQQERDDVLYQFKQGRANILVATDVAARGLDVDGIKYVINFDYPNSSEDYIHRIGRTGRSKSKGTSYAFFTPSNSRQAKDLVSVLQEANQVVSPQLQTMADRCGGGGGGGWNRNRCCRDITNICFNKSCFAAKNSRILTYTTLSSYQKHQFIRYSSVPAAQSDADYCRENKITIIGDDIPSPVRDLDSGNFPDYIKNFLQEQGFTKPTLIQSQGWPIAMAGKNFVGIAQTGTGKTLAYLLPAVIQLKENKGRRGKGPRALVLAPTRELARQIEEVAKDFERLLNIRCLCIYGGVSRSNQAQQLQRGVDILIATPGRLNDFLNSRVTTLSRCTYVVLDEADRMLDMGFEPQIRQALEDVPYERQILMFSATWPKEVQHLAKDYLGEFVQVNVGSTELTANHNIKQCIYVCEQDQKMDKFKSIMHEISGNGFGKVLVFTNTKKFVDSLTLALQRNGWPAVGIHGDKTQLQRDIIINKFRSGKTNILVATDVAARGLDVDGVTHVVNYDFPNTSEDYIHRIGRTGRSDNKGVAHTILTSENARQARSLIQVLKEAKQEVPHELEQLCRDYGSMKFKEQQTKYKPKNNYKWKNYNNRNYNRDGWNSRDKFGSMREMY-