Monarch geneset OGS2.0

DPOGS211231
TranscriptDPOGS211231-TA3759 bp
ProteinDPOGS211231-PA1252 aa
Genomic positionDPSCF300385 - 91027-100524
RNAseq coverage219x (Rank: top 45%)
Annotation
HeliconiusHMEL0034450.083.92% 
BombyxBGIBMGA005168-TA0.080.78% 
Drosophilatst-PA0.048.87% 
EBI UniRef50UniRef50_Q7QBG50.049.08%AGAP003182-PA n=10 Tax=Opisthokonta RepID=Q7QBG5_ANOGA
NCBI RefSeqXP_002013687.10.049.52%GL24270 [Drosophila persimilis]
NCBI nr blastpgi|2700103330.048.42%hypothetical protein TcasGA2_TC009717, partial [Tribolium castaneum]
NCBI nr blastxgi|2700103330.048.59%hypothetical protein TcasGA2_TC009717, partial [Tribolium castaneum]
Group
Gene OntologyGO:00168170hydrolase activity, acting on acid anhydrides
GO:00055242.2e-51ATP binding
GO:00168182.2e-51hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides
GO:00080266e-21ATP-dependent helicase activity
GO:00036766e-21nucleic acid binding
GO:00043861.7e-18helicase activity
KEGG pathwaydpe:Dper_GL242700.0 
 K12599 (SKI2, SKIV2L)maps-> RNA degradation
InterPro domain[1-1244] IPR0164380RNA helicase, ATP-dependent, SK12/DOB1
[1068-1244] IPR0129612.2e-51DSH, C-terminal
[303-482] IPR0140011.2e-28DEAD-like helicase
[310-456] IPR0115456e-21DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[618-704] IPR0016501.7e-18Helicase, C-terminal
Orthology groupMCL10551 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211231-TA
ATGTCTTTAAAATATGATGAGAAATTATTTTCAGACATAAAGCCACCACCGATTTTTGAAGATTTAAGTGAAAGCATAAAAGATTATCTCTTGAAACCGGAGAAACTATCAATTCATAAATGGGAGAGATCACAAACACATTGGCACAGAAAATCGGATATAGATTCTTTGTTCAAAAGCGATGATGAAGAACTAGGTATTGATACCACACTCGAAGTTGTTCGAGATCCTAGAACAGGGGAAATTATAGGTCTGGAAGAGATAAATATACCAGTTCAAGATGACGAAGATAATCTATCCATGTCTCGGGCTCCATTGCCTCCTAACTTGGCTACTAGAGGCACAACTACTCAAAATCCTTTTTTACCAGCTGGTTTTGAGGAGGAACTGCAAAAGATGTTGGATGAGGCGGCACAGAGTTCCGAAATTGAAATTAATTTTGAAGATGATGAGCCTGGAAAATTTCTAGGAGAAGACATTTTATCAACAGCGCCAGGTTCAAAGGAGGCTGTTTTGTTTGCTGAAGACGGAATGACTTTACTTGATCATCAGAAAGATGTACAAGAGGATAAAACTCAAGAACTGGATCTCAAGATAGATATTGATTTGGAAGAAGTGGTTGATAATAATGCACACTTAGTTGGTCTCTGGAAAGATGACGAAAATGAGAAAAATGAGGTCTCGAAACCGATTAAGAAGATACAAATCGAAAAAGACAAGGAAGAAGATAATTTCTTAGAGAGCACCATCATCAGACCACCCGTAGAACTTCCCGAGATACCAATATTAAACATAACAAACTCTGCCGTGAAGCTCGGAGTCACATCTACCGAGTGGGCCGAAATGATTGATGTATCCCTACCGGTGCCAGATTTTAAAGAAAAAATAAAAGACATGGCACATTCATATCCATTTGAATTGGACAGCTTCCAGAAACAGGCTATACTTAAATTAGAAGAAGGCCATCATGTATTTGTAGCTGCCCATACATCTGCCGGGAAGACAGTTGTAGCCGAGTATGCTATAGCCATGTCAAGAAGAAATTGTACCAGAGCAATCTACACATCACCAATCAAAGCCCTATCAAACCAGAAGTACAATGATTTTAATAAAATGTTTGGTGAAGTTGGTCTCCTGACTGGAGACCTTCAGATCAACGCTACAGCCTCCTGCCTCGTGATGACCACTGAGATACTAAGGTCCATGTTGTACTGCGGCTCTGACGTCACCAGGGACCTAGAATTCGTTATATTCGATGAGGTCCACTACATTAATAATACTGAGCGTGGCTATGTTTGGGAGGAGGTTCTAATTCTTCTTCCTGCCCACGTCAGTATAGTGATGTTGAGCGCGACTGTACCCAACACTCTACAGTTCGCTGACTGGGTGGGTCGTACTAAGAAGAGGAAAGTCTATGTCGTGTCTACGCCTAAGAGACCTGTACCTTTGTGCCATTATTTATATACAGGGTCGGGAGGTAAATCGAAAAATGAAAGATTTCTGGTCGTCGATCAAGAGGGTGCCTTTCAGTTGCGCGGTTACAATGAAGCTGCTGCCGCTAAGAAGGCGAGAGAGAACGAATATAAGAAGAGTTTTGGCCCGAAAGGTGGAAAACAATTCGGGAATCCTAAAGCCGAACAAACCATGTGGGTAGCGTTCATAGATCACCTGAGGAGCTGCGATAAGTTGCCCGTCGTGGCTTTTACCTTGTCGAGAAATCGGTGTGATCAGAATGCTGAAAATTTGATGTCAGTAGATTTAACAACGGCCAAAGAGAAAAGTCACATCAAATCATTCTTCATGAGATGTCTTCAGAGGTTGAAGGAGCCTGATAGAAAGCTTCCACAGGTGATACGTCTCCAAAGAGTATTGGAGAACGGTATCGGGGTACATCACAGCGGTATATTGCCTTTGCTTAAAGAAATCGTCGAAATGCTCTTCCAGTCCGGTCACGTGAAAATTCTTTTTGCGACGGAGACGTTCGCTATGGGCGTCAACATGCCAGCGCGTACTGTAGTCTTCGATGATATCACCAAGTTCGACGGCATACAGTCCAGGAGCCTCGCGCCAGCTGAGTACATACAGATGGCGGGCAGGGCAGGGAGGCGAGGTTTGGACGATACAGGCACAGTGATAATCCTCTGCAAAGAAGGCGTTCCAGATCAAGTGACGCTTAAAGGAATGATGTTGGGAACTCCACAGAAGCTGTCATCGCAGTTCAGGCTGACATACGCCATGATACTCAGTTTATTGCGTGTAGCAACAGTATCAGTGGAGGGTATGATGCAGAGATCCTTTAGAGAATTCCATCAGATCTGCCAAGCCGACAACAACAGGAAACAACTGCAATTAGCTGAAAAGGAATATTCAGAGAAATGTAGCACACCCCTGCCATCGCATTTGGCGCCGTTAGCCACTTTCTATGACATAGCCATACAATATATAGACGTTTTAAATGATATCATGCCAATATTACTGAACCAATCTAAAGTTGTTAAGGAATTCGTGCCAGGCAAGGTTCTCATAATATCCGCCGGACCGTTCATAAATCAATTGGGTGTCTACTTGAACAACAGCGGTCCCAGGCAAACCCCATACAAGGTACTAGTTTTAAACACAGCTGAACAAGATACAGCTAGATACAACTTTGATGTGGACGAAAATTGGTACAGGATGTTGGGCTTCTCTAAACTCTATGAAAACATAGGTACTGAAGAAAGTACAATGGATCATACAATACTGTGTATAGCGCCTAAGAATATTGTGGCTGTTACAAAAACTAATCTTAAAATTGATGCTAATCTCATCATAAGAGACTGGGAGCAAAGACAGATGCCTCGTTTCAAGGATGCTCCAGTAGGTGCCACCTGTGGGCGATCAGTGCAGGAGCTGTGCCAGTTATCTCACGCTTCACGCACTTCAACCGCCGGCCTGGAGACGCTCAGTCTCACACAGGCACTCGCCATCACCACTGGAGAGATACTACAGACACTAGACAAGATGAACAAATACAAATCCGAGCTCGAGGCACAGAAGAAATACACAGATATAGCGAACTTCAAGAGCGAATTCGCTGTTGTGTACGAACGGAAACAAGCTGAGAGGAAACGTGATAAGTACAAACGTCTTCTGTCCTTCGAAAATCTAGCTCTATATCCAGACTACCAGAGACGGTTGATGGTTCTACGAGAACTGAACTATATAGATGATCATGACAGCGTTATCTTAAAGGGTCGTGTTGCGTGCTGTATGGGCACTAACGAGCTTATCATATCAGAACTGGTGTTCCGGAATGTATTCACCGATAAAAATCCAGCGGAAATCGCGGCACTCCTCAGCTGTTTCGTGTTCCAAGCTAAGACTAGAGTGGAACCGGCTTTGACTGAGAAGTTACAGGCTGGTGTTAAGGCTATAGAACAGATTGATGATGAACTTACTAGGATCGAGGCTAAATATATGGTCGGACAATTCGAAGGTCAAGCAGAGAGATTAAACTTTGGTCTAGTGAGAGTTGTCTATGAATGGGCCCTAGAAAAACCGTTTGCAGAAATCATAGACTTGACAGATGTTCAAGAAGGTATTATTGTGAGATGCATCCAGCAACTTCATGAGCTCTTAGTTGATGTGAAAGACGCAGCAGTTGCAATTGGTGATCCAAAACTTCAAGCAAAAATGATGGAGGCTTCCACAGCTATAAAGAGGGACATAGTTTTTGCAGCAAGTTTATATACTACTCAGCGAGAGACAGTGATATTATGA

Protein sequence:

>DPOGS211231-PA
MSLKYDEKLFSDIKPPPIFEDLSESIKDYLLKPEKLSIHKWERSQTHWHRKSDIDSLFKSDDEELGIDTTLEVVRDPRTGEIIGLEEINIPVQDDEDNLSMSRAPLPPNLATRGTTTQNPFLPAGFEEELQKMLDEAAQSSEIEINFEDDEPGKFLGEDILSTAPGSKEAVLFAEDGMTLLDHQKDVQEDKTQELDLKIDIDLEEVVDNNAHLVGLWKDDENEKNEVSKPIKKIQIEKDKEEDNFLESTIIRPPVELPEIPILNITNSAVKLGVTSTEWAEMIDVSLPVPDFKEKIKDMAHSYPFELDSFQKQAILKLEEGHHVFVAAHTSAGKTVVAEYAIAMSRRNCTRAIYTSPIKALSNQKYNDFNKMFGEVGLLTGDLQINATASCLVMTTEILRSMLYCGSDVTRDLEFVIFDEVHYINNTERGYVWEEVLILLPAHVSIVMLSATVPNTLQFADWVGRTKKRKVYVVSTPKRPVPLCHYLYTGSGGKSKNERFLVVDQEGAFQLRGYNEAAAAKKARENEYKKSFGPKGGKQFGNPKAEQTMWVAFIDHLRSCDKLPVVAFTLSRNRCDQNAENLMSVDLTTAKEKSHIKSFFMRCLQRLKEPDRKLPQVIRLQRVLENGIGVHHSGILPLLKEIVEMLFQSGHVKILFATETFAMGVNMPARTVVFDDITKFDGIQSRSLAPAEYIQMAGRAGRRGLDDTGTVIILCKEGVPDQVTLKGMMLGTPQKLSSQFRLTYAMILSLLRVATVSVEGMMQRSFREFHQICQADNNRKQLQLAEKEYSEKCSTPLPSHLAPLATFYDIAIQYIDVLNDIMPILLNQSKVVKEFVPGKVLIISAGPFINQLGVYLNNSGPRQTPYKVLVLNTAEQDTARYNFDVDENWYRMLGFSKLYENIGTEESTMDHTILCIAPKNIVAVTKTNLKIDANLIIRDWEQRQMPRFKDAPVGATCGRSVQELCQLSHASRTSTAGLETLSLTQALAITTGEILQTLDKMNKYKSELEAQKKYTDIANFKSEFAVVYERKQAERKRDKYKRLLSFENLALYPDYQRRLMVLRELNYIDDHDSVILKGRVACCMGTNELIISELVFRNVFTDKNPAEIAALLSCFVFQAKTRVEPALTEKLQAGVKAIEQIDDELTRIEAKYMVGQFEGQAERLNFGLVRVVYEWALEKPFAEIIDLTDVQEGIIVRCIQQLHELLVDVKDAAVAIGDPKLQAKMMEASTAIKRDIVFAASLYTTQRETVIL-