Monarch geneset OGS2.0

DPOGS204832
TranscriptDPOGS204832-TA2412 bp
ProteinDPOGS204832-PA803 aa
Genomic positionDPSCF300221 + 349730-359007
RNAseq coverage152x (Rank: top 53%)
Annotation
HeliconiusHMEL0143960.086.08% 
BombyxBGIBMGA001572-TA0.083.04% 
DrosophilaCG32533-PA4e-13653.21% 
EBI UniRef50UniRef50_G6DDZ80.099.03%ATP-dependent RNA helicase n=7 Tax=Endopterygota RepID=G6DDZ8_DANPL
NCBI RefSeqXP_321806.42e-15567.61%AGAP001338-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583021964e-15467.61%AGAP001338-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571175939e-14867.18%ATP-dependent RNA helicase [Aedes aegypti]
Group
Gene OntologyGO:00055241e-15ATP binding
GO:00043861e-15helicase activity
GO:00036761e-15nucleic acid binding
GO:00080263.3e-05ATP-dependent helicase activity
KEGG pathwaycim:CIMG_007434e-70 
 K12818 (DHX8, PRP22)maps-> Spliceosome
InterPro domain[47-229] IPR0140011.8e-22DEAD-like helicase
[287-384] IPR0016501e-15Helicase, C-terminal
[452-566] IPR0117091.9e-14Domain of unknown function DUF1605
Orthology groupMCL11886 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204832-TA
ATGGATGTCTGTGATGAGAGAAACAACAAAAAGTTAAATAAAGATACTTTTGATATATTTCTTAATATAGTATCTATATATTTAGACTTTAAAAATAAAGAAAAATTTGATAGATTAAAAAATTACGTAAAGCTCAAAAGTGAATTACCAGTCGCTAAATACAGGAATGAAATAGTATCAGCAGTACAAAATGAAAGAGTGGTGATCGTAGCCGGAGACACAGGTTGCGGTAAATCAACACAAGTGCCACAATACTTACATGAAGCGGGATTTCAAAATATAGCTTGCACTCAACCTAGAAGAATCGCATGTATTTCACTGTCGAAGAGAGTGTCATATGAAATGCTAACCCAGTTTGATACTAAAGTTGGCTATCAGATCAGATTCGAGAAAAGCAAGACATCAGACACCAAAATATGCTTCATCACTGAAGGTTTGCTTCTGAGGCAGATGTCTTCAGATAATCTGCCCGAGTATGATGTTATTATTCTAGATGAGATACATGAACGTCACCTCATGGGTGATTTCCTACTGGGTGTACTTAAATGCCTCATCCACACAAGAACTGATATTAAACTCGTTCTCATGTCAGCGACTATTAATATAAAACTGTTCCAGGATTACTTTTCAGCCGAATCAGCTGTAGTTATACAGGTCCCCGGCAGACTCTTTTCGATAGAATTAAATTACAAACCTATACTCATAGAAGAAAAACCTTCTAGGCACGATAAATTAGATCCTCAGCCATACGTACAGATTATGCAGTTGATAGATAGCAAATATCCAAAGGAGGAAAGGGGTGACCTGTTGATATTTATGTCCGGTGTACAAGAAATAACGACAATATGCGACGCGGCGCAGCAATACGCGGAGAAAACAAAGAGTTGGATCGTACTTCCCTTGCACAGTGCGTTGTCACTTATTGAACAAGATAAGGTGTTCGACTATCCGCCAGATGGCGTTAGGAAATGTATAGTCTCAACGAACATAGCCGAAACATCTGTCACCATAGATGGCATAAGATTTGTCATTGACTCCGGGAAGGTTAAAGAGATGAGTTATGATTCGTCAACAAAAATGCAAAGGTTGAAAGAATTCTGGATTTCAAAGGCTAGCGCTGATCAGAGGAAGGGGAGAGCGGGCAGGACTGTTTCAACATATAAATTCAAATTGATGCTACATGGACAGTACAAACAGAAGGCTGCCGAGGATGCGAAGCGTCGTAAGCGTCTCAAGGTCGACACGTGGGAGATAGGTGATGAGGACGATGACGTCATCGATGTGAGAGACATAGAGTTCAGGATGACCAATGACGCTGCCAGGATACGAGCACTGATCAGCGGAGCAAGTACTAGCGGCGGACAGGATCTTGTTATGTTGAAGATCGTGTTATGCAGAGCTTTGTATCCGCAAATAGCTATCGCTGACGAATTCAATTACTGCAAGACAGAGCAGCTATATCACACCTGGAGCAAACCCTCCGTGTACCTTCACCCGACTTCATACTTCGGGAGATACCCTAAAGCACTACAGCTGACCGAGACGGACATACAGACGGCGCCGGGGTATAAGAGCAAGCTGCCGCTGTCAAATAAACACCAACTGTTGTGTTACTTGTCTCTGCTTGAAACCACGAAGCCCTACATAGTTAACTCTATGCGTATGCCGGCAGCGCAGACGTTGCTGCTTCTAGCACATTCCATAGACACAAACACAGGATTCACAAGGATAGTTTGTGACTCCTGGCTCCTCCTGGAATTCCCTTTCCCTGAATCGGGATGCCAATTGCTATATAGAGCATCCACGATAAGAAAGAAATGGGACGAACTGATTAATAGAAAACTTGCAGATGCAAACCCCAACAGGTCGGTGGAGGAGGAGCTCCAGAAGTCAAATCAAATGGGTTACGAGGAACTACAGCATGAGCTATCGTGTGAGATAAGTAAATATATGAACTGTGATGTGTCTTATACCCTCAAGAGGTTACTGCCGGGGGACTTGAAGGTACTGTACGATGGTGACACCCAGACGACTGTATCTCCTAACCCATTCGATCAAACCTATGTCTGCCGACCCCATGATAAGAAGGGCGGGGTTTATGTCACTGATAATATTGTATACAATTGTATTGTAGATTCAGAGTGGAGCTATGATAGTTATCAGGAAACCTACAGCATACCGTGGGTTTGTCCGCAGTGCGAGGTGACTGTCTGTCTGTCACCTTTAGAAAGATTACAGCATAGGATATTCACTTGCTCATCAAAAACTGAGAAGAAACTAGAGAAGACAGTTACACGCATTAACAGACCCAATACTAAAGAATTTATTTGTGATGTTTGCAATACAACGATGTCCCTAACGCCCGTTGAAATATTGAAACATAAGAAGGCTTGTAAAATAAAGGAGCAATGA

Protein sequence:

>DPOGS204832-PA
MDVCDERNNKKLNKDTFDIFLNIVSIYLDFKNKEKFDRLKNYVKLKSELPVAKYRNEIVSAVQNERVVIVAGDTGCGKSTQVPQYLHEAGFQNIACTQPRRIACISLSKRVSYEMLTQFDTKVGYQIRFEKSKTSDTKICFITEGLLLRQMSSDNLPEYDVIILDEIHERHLMGDFLLGVLKCLIHTRTDIKLVLMSATINIKLFQDYFSAESAVVIQVPGRLFSIELNYKPILIEEKPSRHDKLDPQPYVQIMQLIDSKYPKEERGDLLIFMSGVQEITTICDAAQQYAEKTKSWIVLPLHSALSLIEQDKVFDYPPDGVRKCIVSTNIAETSVTIDGIRFVIDSGKVKEMSYDSSTKMQRLKEFWISKASADQRKGRAGRTVSTYKFKLMLHGQYKQKAAEDAKRRKRLKVDTWEIGDEDDDVIDVRDIEFRMTNDAARIRALISGASTSGGQDLVMLKIVLCRALYPQIAIADEFNYCKTEQLYHTWSKPSVYLHPTSYFGRYPKALQLTETDIQTAPGYKSKLPLSNKHQLLCYLSLLETTKPYIVNSMRMPAAQTLLLLAHSIDTNTGFTRIVCDSWLLLEFPFPESGCQLLYRASTIRKKWDELINRKLADANPNRSVEEELQKSNQMGYEELQHELSCEISKYMNCDVSYTLKRLLPGDLKVLYDGDTQTTVSPNPFDQTYVCRPHDKKGGVYVTDNIVYNCIVDSEWSYDSYQETYSIPWVCPQCEVTVCLSPLERLQHRIFTCSSKTEKKLEKTVTRINRPNTKEFICDVCNTTMSLTPVEILKHKKACKIKEQ-