Monarch geneset OGS2.0

DPOGS207042
TranscriptDPOGS207042-TA2181 bp
ProteinDPOGS207042-PA726 aa
Genomic positionDPSCF300001 + 1798417-1804419
RNAseq coverage556x (Rank: top 23%)
Annotation
HeliconiusHMEL0068590.065.80% 
BombyxBGIBMGA012983-TA0.082.47% 
DrosophilaDdx1-PA0.068.97% 
EBI UniRef50UniRef50_Q9VNV30.068.97%ATP-dependent RNA helicase Ddx1 n=31 Tax=Bilateria RepID=DDX1_DROME
NCBI RefSeqXP_392325.10.072.43%PREDICTED: similar to Dead-box-1 CG9054-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|480955900.072.43%PREDICTED: ATP-dependent RNA helicase Ddx1 isoform 1 [Apis mellifera]
NCBI nr blastxgi|480955900.072.43%PREDICTED: ATP-dependent RNA helicase Ddx1 isoform 1 [Apis mellifera]
Group
Gene OntologyGO:00055244.9e-59ATP binding
GO:00080264.9e-59ATP-dependent helicase activity
GO:00036764.9e-59nucleic acid binding
GO:00043863.6e-26helicase activity
GO:00055159.2e-23protein binding
KEGG pathway 
InterPro domain[26-413] IPR0115454.9e-59DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[21-440] IPR0140012e-39DEAD-like helicase
[44-247] IPR0089852e-34Concanavalin A-like lectin/glucanase
[127-243] IPR0183552.2e-29SPla/RYanodine receptor subgroup
[513-596] IPR0016503.6e-26Helicase, C-terminal
[127-242] IPR0038779.2e-23SPla/RYanodine receptor SPRY
Orthology groupMCL14157 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207042-TA
ATGACTGCATTTGAGGAATTTGGAATGCTTCCTGAACTAGGGAAGGCGATAGATGAAATGGAATGGACCCTTCCTACCGACGTACAAGCTGAGGCAATTCCTCTTATTCTTGGTGGTGGAGACGTTCTTATGGCAGCCGAGACTGGTAGTGGTAAAACCGGCGCTTTTTGTCTGCCAATACTTCAGATAGTTTGGGAAACATTAAAAGATATTCAAGATGGTAAAACTAGTAGAATAATACAGGCTGTATCAGGAGAGTGTACAATGTCTTTCTTTGATCGTACTGAGGCTCTAGCTGTCACTCCAGATGGATTGCGATGTCAGTCTAGACATTTTAATGCATGGCATGGATGCAGAGCCACTAAAGGTATACATACTAAAGGTGTCTATTATTATGAAGCTACTGTCACTGATGAAGGATTGTGCCGTGTTGGCTGGTCAGCACCAGGTGCTAACCTGGATCTTGGGACAGACCGGCTTGGTTTTGGATTTGGTGGCACTGGGAAAAAATCAAATTGTAAACAATTTGATGACTATGGAGAGGCTTATGGCAAGAATGATGTAATTGGTTGTCTACTTAATCTAAACAATGGTGATATACGTTTTTACAAGAATGGTCAAGATTTGGGGTTGGCATTTAAACTGGATCAGTCTCGTAGATCAGACTGTTATTTCCCAGCCGTGGTGCTGAAGAATGCGGAAATAAGTTTTAATTTTGGCAACACACCCTTTAAGTATCCACCTAACTGTGAATACATTGCTATAGCAAATGCTCCAAGAGAATATGTCAAAAACAACCCAGTTTGTTCTGTGGCATGTGAAGGAGCAAAACAAGTCAATAATGCACCCCAGGCTATTATAATTGAGCCATCACGAGAGTTAGCAGAACAAACATGCAATCAGATTACACTGTTCAAGAAGTATATAGATAATCCAAAAATAAGAGAGCTTTTAGTCGCGGGCGGTATAAATGTGAAGGATCAAATAAGTCAACTCAATGCTGGTATAGACATTGTTGTTGGCACACCCGGCAGGATTGAAGATCTCATTCAAGGAGGCTACCTGGCGCTAACCCACTGTCGGTTCTTTGTGCTGGATGAGGCTGATGGGTTGCTCAAGTCTGGTTGCGGCGACATGATCGAACGTCTCCATAGACAGATACCGAAGGTCACTTCAGACGGGAGACGCCTCCAGATGGTTGTCTGCTCTGCTACATTACACGCTTTCGAAGTTAAGAAAATGGCGGAAAAACTAATGCATTTCCCCACATGGGTGGATCTGAAAGGCGAAGATTCAGTGCCGGAGACAGTTCATCATGTCGTGGTGAATGTTGATCCACAGGAAGACAAATCCTGGGAGACAGCACAGAAAAAAATACTGACCGACGGTGTTCATGCCAGAGACAATATAGATAACATGTCGCCAGAGACCCTGTCGGAGGCTGTTAAGGTACTCAAAGGAACGTACTGTGTACGAGCCATCAGAGCACACAAGATGGACAGAGCCATAATCTTCTGCCGTACGAAGCTGGACTGTGATAACATGGAGCGGTTCTTGAAGTCTTTCGGTGAAGACTTCTCTTGTGTATGTCTTCACAGTGATAGAAAACCCAAGGAGAGGAAAGAAAACCTTGAGATGTTCAAACAGAGCCGCGTCAAATTTCTCATATGTACAGACGTAGCGGCTCGAGGAATCGATATATCCGGATTGCCTTTTATGATAAACATTACTCTACCTGATGAGAAGTCGAACTACGTTCATCGCATTGGTCGTGTTGGTAGAGCTGATCGTATGGGTCTGGCCATAAGTCTGGTCGCCACTGTACCCGAAAAGGTGTGGTATCACGGTGAGTGGTGTTCCTCCCGCGGCCGGAACTGTTGGAACACAAATTTGAAGGACGAACGGCCAAAAGGCTGCTGTATGTGGTACCAGGAATCCCAGTATCTAGCCGACATTGAGGATCATTTGAATGTTACCATACAACAAATTGAGACTGACATGGTTGTTCCGTGCAATGAGTTTGATGGTAAGGTGGTATATGGACAGAAGAGGAAACAGATGAGCTCAGGGTATCAGGATCATGTCAGTCAGATGGCGCCAAAGGTGCGGTCGTTGAAGGAATTAGAATCACAGGCTCAGGTGTGCTATTTAAGGATTCATTATAACGTCGATGTTATGTAA

Protein sequence:

>DPOGS207042-PA
MTAFEEFGMLPELGKAIDEMEWTLPTDVQAEAIPLILGGGDVLMAAETGSGKTGAFCLPILQIVWETLKDIQDGKTSRIIQAVSGECTMSFFDRTEALAVTPDGLRCQSRHFNAWHGCRATKGIHTKGVYYYEATVTDEGLCRVGWSAPGANLDLGTDRLGFGFGGTGKKSNCKQFDDYGEAYGKNDVIGCLLNLNNGDIRFYKNGQDLGLAFKLDQSRRSDCYFPAVVLKNAEISFNFGNTPFKYPPNCEYIAIANAPREYVKNNPVCSVACEGAKQVNNAPQAIIIEPSRELAEQTCNQITLFKKYIDNPKIRELLVAGGINVKDQISQLNAGIDIVVGTPGRIEDLIQGGYLALTHCRFFVLDEADGLLKSGCGDMIERLHRQIPKVTSDGRRLQMVVCSATLHAFEVKKMAEKLMHFPTWVDLKGEDSVPETVHHVVVNVDPQEDKSWETAQKKILTDGVHARDNIDNMSPETLSEAVKVLKGTYCVRAIRAHKMDRAIIFCRTKLDCDNMERFLKSFGEDFSCVCLHSDRKPKERKENLEMFKQSRVKFLICTDVAARGIDISGLPFMINITLPDEKSNYVHRIGRVGRADRMGLAISLVATVPEKVWYHGEWCSSRGRNCWNTNLKDERPKGCCMWYQESQYLADIEDHLNVTIQQIETDMVVPCNEFDGKVVYGQKRKQMSSGYQDHVSQMAPKVRSLKELESQAQVCYLRIHYNVDVM-