Monarch geneset OGS2.0

DPOGS210817
TranscriptDPOGS210817-TA5742 bp
ProteinDPOGS210817-PA1913 aa
Genomic positionDPSCF300027 - 592616-610407
RNAseq coverage588x (Rank: top 22%)
Annotation
HeliconiusHMEL0085180.057.43% 
BombyxBGIBMGA007133-TA0.058.27% 
DrosophilaHel89B-PB0.046.27% 
EBI UniRef50UniRef50_D6WW100.048.95%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WW10_TRICA
NCBI RefSeqXP_966659.10.048.95%PREDICTED: similar to TATA-binding protein-associated factor 172 [Tribolium castaneum]
NCBI nr blastpgi|910884130.048.95%PREDICTED: similar to TATA-binding protein-associated factor 172 [Tribolium castaneum]
NCBI nr blastxgi|910884130.048.85%PREDICTED: similar to TATA-binding protein-associated factor 172 [Tribolium castaneum]
Group
Gene OntologyGO:00168173.6e-86hydrolase activity, acting on acid anhydrides
GO:00036772.9e-76DNA binding
GO:00055242.9e-76ATP binding
GO:00054886.5e-45binding
GO:00043861.1e-20helicase activity
GO:00036761.1e-20nucleic acid binding
KEGG pathwaykla:KLLA0E23804g0.0 
 K01509 (E3.6.1.3)maps-> Purine metabolism
InterPro domain[627-1115] IPR0227073.6e-86Domain of unknown function DUF3535
[1362-1651] IPR0003302.9e-76SNF2-related
[48-1396] IPR0160246.5e-45Armadillo-type fold
[1355-1546] IPR0140011.2e-28DEAD-like helicase
[1171-1393] IPR0119893.5e-23Armadillo-like helical
[1725-1811] IPR0016501.1e-20Helicase, C-terminal
Orthology groupMCL13703 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210817-TA
ATGCGCTGCTCCTGTCTCGCCAATACATTTAACAACTGTGACGTATTTGATAAAAGAAAAACATTGAAGCTGAATTTAAATTCTGAAAATGACTTCTCGTCATCATCGACATACAGTCAAAGTGACAGGTTCCTTCATCACCCCAGGTTGGATCGTCTGTTCGTGTTGCTGGAGGCGGGCGCGGGCCCGGCCACGAGGCGAGCCGCCGCCAATCAGCTCGGGGAAGTGCAGAAGGCTCATCCCGAGGAACTGCACCGCCTGCTGGCCAGGCTCATGAAACACCTGCGGTCACCGGCCTGGGAGACGAGGGTCGCTGCTACACAGGCGGTAGAGGCGATATTATCACATGTCCCAGAGTGGCTCCCCCCGCCATGCACAGCTAAAGAGGAGGATATTGATCAAGATGACAGCAGCCGTCTGAGATGTGAAACGTTCGACATCGAACGTGTCCTGGAACATGGAGCGCATCTCATGGGATCCGAGGGTCACGAGTATGACCTGGATGAGGAGACGCTTTCAGCTACAGATATGAAGGACCGTCTGACCAAACAGCGCCAGAACCTCAACAGTCGCTTGGGTCTGGACGTGGCTGCGTCTCTGGGGGTGGACCTCAGCGGGGTCTACTCCAACGAGGACCTCTGCATCCAGAAGACAGCTACTACACCCACAAATAGACGTCCCGTCCAGGAGCTGGTGTTGTCGAAGGGTCTCAGCTCGCGTGAAATGAATCTGGCTAAGCGTAAGGCTCGGCTGGCCTTCAGCAAGCAAAAGTCCCGCGATTGTGAGGAGTCTCCCGCCACGACGCCCACCACGCCCACCACCACCATCGCCAGCATCGCCACCATCGCCAGCATCGCCACCACCACCATGGAGCCAGAACGGAAGAAGATCAAACTGGAACAGGCCGACGATACCATCTTGGAGGTGTCTGGTTGTGCGGTGCCGAGCCGTGATGGCAGTTGGGGGGAAGGTCATCGGTGGCCCCTAGGGGCCTGGAGCGGGGCCCTCACCGTTCAGCTACTCAGTGGGGCCTGGGAGGCCAGGCACGGGGCTGCCTCTGCTATAAGAGAGTTACTGAGGGCCCCTATCGTGCGCTCCGCTGGATGGAAGGCTGGAATGACTGCACAGCAGATGGAGGACTCTCATCAGGAGTGGTTGGAGGACATGGCGCTTCGTTTACTGTGTGTGCTGGCGCTCGACAGATTTGGAGACTTCGTTTCTGATCAGGTGGTGGCCCCGGTGCGTGAGTCTAGCGCTCAGGCCCTGGGCGTGTGTCTGTCACGACTGAGGGCTGAGCGAGTGTCGCTAGTGGCGAGGGCGCTGGGAGGACTCGCACGACATGCGCAGTGGGAGGCCAGGCACGGGGCGCTGCTGGCCTTCAAATACCTACTGGCAGCCAGGAGGGAGGTGGCAATCGAATGCGGGCTGGCGGCGAGGCTGTGGGCGCTGCTGCGGGACCAGGACGAGCTCGCGGCGCCCGCCCACAAGTATATGGCGCTGCTGGCTGCGCTCATGGCCCTGCCCGCCGCCGCCGCTCACCTACATCCCATTGATTTGGCGGATGTCCTGCCAAGACTGTGGCCGTACCTCGACCATTCCACCAGTTCGGTGAGGAAAGCGACTCTACAAACACTGAGAACAATCACAAGACCCCTCGTAACTACAACAAGTAACGGACAGAACGGTGGTGGAAATGAAAAGGACGATGCTCAGAGTAATACAGACAATAGTCAAGGGAGCGACACTCGAGTTGAAGAGAACGGAGACTCCAACAATTACCTGCAATGGACCCCGGAACTACTACAGGATGCGATGCGGCATGTGTACCAAAGGGTGTTGTTCGAACACGTTGATGATATACAGGAGATAGCATTACAGGTGTGGGAGAACCTCCTTCATCACGCAGACTTGGGTGTGATCCTGGTGGCTTCATGTCCGCTGCTCTCGCTCTGGATGTGTCTCGCCATGCAACCCGCTAGACTGCCGCTTGACCCTGCACTATTACTCCACACGCCCACAAAGGAACGCTCCTGCCGAGTCCGGATTGGTGGAGTTATTCAACCAAATAATGAAGGTGAACCGAGGCCAAGTCAGAAATGGTATCTCGGTGGAAGCGAGTCACAGTCGGCTGCTATCAGAGATGCCAACGTCACCAAGGCTAGATGTTTGGCAGCGGAGTTGTTGGGTCATCTGTCCTGTTACCTCGTACAACCGGCCCCGGGGATTGAATACAAGGCAGAAGACGAGAGCCCCATTGACTGTTATGTTAGGGTTATGTTAGTTTACCTGAGATCCGGCAGTGCTCTCCAGCGGCTGGTTGCCGGTCTGGTGCTGGCGTCGTGGGCGCGGAACTCCTTGCGATATGGACTTTACCCACCACACCTAAAGCATACGGAAACAAGGGAAGACAATCTCAGTGTTGTATCTAAACTGGCGCCGCCGTCGCTGGTGACCGCCTTGCACACCGCGCTCAACCAGACCCTGTACTATGATGAGGTCGCTCTCAACTGTAACAGGATACTACAGGAGGCGCGCGACCTGCTGGCCATGATGAAACATTACAAACTACCAGTGGACAATGAAGAGTATAATAATATACTGAGACTAGAACAGGTGTCAATGCTGGCGGCGGGATGCGAAGCGGCGGTGGCCGGGGCTCGCAGTAAGAGGGTGGCCGCCTCGCTGGAGGAGCGCCGCGTCGCCCTGCTGGAGGCCGCCCGGGCTGCCACACGGGAGCACGCCGCGCTCGCCGTGTCAGTCCAAGCCGCCCTCAGCAGTGCGGTGGCCCAGCTGGCGGCGTTACCCGAGCGACTGAACCCAGTGGTGAGACCTCTCATGGAGAGTATCAAGAAGGAGGCTTCCGAGGAGCTGCAGAGATTGTCAGCCGGGACACTAGCAGCGTTACTGGCACAGCTCGTCCACAGACAACCCTGTCCTAATAATAAAGTTTTGTCTAATCTTAAAGCTTTTCTGAGTTGCGATCCAGAATTCACTCCGCGCATAAGTTTGGAGACAACAGAGGAAGTGAACGGTGATTCCGGAAGTGGCGACAGTGGGGGGGAGAATGAAGGTCCTGCCAACCAGACACCGAGCTTGGACAAGTACGAGGGCATCCTCACTCTCCGCGAGCAGCAGCGGAGTGCGGAGCGACTGGTGACGAGGCGGGGCCGGCCGCCCGCACACACGCCCGCAGACCTCAGCCTGGACCAGCTGTTCCCGCACGAGGACGAGGCCAGGAAATTGCTTCGGATACAAAGGAGGGGCGCCACGATGGCCTTGACGAGTCTCACGGAATACTTCGGTGATGATCTGCCGGAGAAGCTGCCGAGGCTGTGGGAGTTTATCATTGAACCGTTCGAGAAGGTTATGTCCGACGAAGAATTGGAAAATATTCCAGAAGAAGTTGTTGAAGAATTAATATCGAGACTCCAGGTCGTGGAAGCTGTGGCGGGCAGTGTGGGGAGAGGGGAGAAGAGTATTACGAGGTTTAGAGAGAGCAAAGGCAAAGATAAAGATGCGAGAGATAAAAGCGCCAGTGATAGCGAAGCGGGCAGTTCCTCGATGGGTGGTAGTGGGCACGTGTGGGCGCGTGTCGTGAGCGGCGTGGGCGCGTGTGCAGCTTTATGTCGCGCACGTCACACAGCCCTACGGCACATGGCGGCCCGCGCGTTAGCCGCCATCGCACAGAGAGATCCCCATCCAGTCATGCGCGTCCTCGTCGATGAGTTGGTGTCGTCGTTGGAGCACCCCAGTCCCCGAGTTCGTTGCGGCGCGGCGGAAGCCCTGGCGCGTGTCGTGGACTCCTTGCAGCTGCATGTGGTGCCCTACATCGCTCTGCTGGTCGTGCCGCTGTTAGGGCGCATGAGCGACCACAACCCGTCAGTGCGCACGATGTCGACGCGGTGCTTCGCCACCCTCATCCAGCTGATGCCGCTGGATGGGCCGGCGGGGATTCCCCCCGACCTGTCCCCGAACATGCTGCAGCGGCGGAACAGAGACAAGACCTTCCTCGACCAACTCTTCAACCCCAAGTCCATCAAAGATTATAAGATACCCATACCGGTGACGGCCGAGCTCCGGAGCTACCAGCAGGCGGGTGTGAACTGGCTTCGCTTCTTGAACGAGTACAAGCTGCACGGCGTGCTGTGTGACGACATGGGTCTGGGGAAGACGCTGCAGTCGGTGGTGGTGGTCGCGGGGTCCCACTACGAGCGCGCTCAGAGCGGCGCCCCGCAGATGCCGTCGCTGGTGGTGTGTCCGCCCACGCTCACCGGTCACTGGGTGTTCGAAGTGACCAAGTTCATTCCGTGTCAATACCTGAAGCCGCTTCCCTACGTCGGTCCGCCGGTGGAGCGAGAGAGGTTGAGGGCCCAAGTGCCGTTCTACAATCTCATAGTGGCTTCCTACGACATAGTCAGGAAAGACATAGACTTCTTCAGCGGCATCAAGTGGAACTACTGCATCCTGGACGAGGGGCACGTCATCAAGAACGGGAAGACGAAAGCCTTCAAGGCCATCAAACAGATCGTCGCCAATCACAGGCTGATTCTGTCCGGAACGCCGATACAGAACAACGTGCTCGAGCTGTGGTCTCTGTTCGATTTCCTGATGCCAGGTCTGCTGGGGTCTGAGCGTCAGTTCACGGCGCGCTACTCGAGACCCCTGCTGGCCGCTCGGGACCCCCGCGCTTCGCCCCATCACCTGCAGGCTGGAGCACTCGCCTGCGAGGCGTTACATCGACAGGTCCTGCCATTCTTATTGCGACGTGTTAAAGAAGACGTGCTCCGGGAGTTGCCTCCTAAAATAACCCAGGACTACTACTGCGACCTGAGCCCGCTACAGCGGAGACTCTACGAACACTTCTCCAAAGAACACATGCCGCAGGAGGCCACTCACTCCCACACGCACGTGTTTCAGGCGCTGCATTATCTTCAGAACGTGTGTAACCATCCCAAGCTGGTGCTGGTGGAATCTCACCCGGAGACTAACCGCGTCACGCAGCAGTTGGCCGCCGCGGGCTCCTCGCTGGATGACATACAGCACGGGGCGAAGCTCCCGGCGCTCAAGCAATTGCTTCTCGACTGCGGCATTGGTAGCGCTTCAACCGGGGACGAGAGTGCAGTAGTGTCGCAGCACCGAGCGCTGATCTTCTGCCAGCTCAAGAAGATGCTGGACATCGTGGAAAAAGATTTGATCCAGAAACATCTGCCCTCCGTCAGCTACCTCCGCCTGGACGGCAGCGTGCCGCCCCACCAGAGGCACGCGATCGTGACACGGTTCAACACGGACGTGTCCATTGACGTACTACTTCTCACCACCGCTGTGGGCGGTCTGGGTCTTAATCTGACGGGGGCGGACACGGTCATATTCGTGGAACACGACTGGAACCCCATGAAGGACCTGCAGGCCATGGACCGCGCTCACCGCATCGGCCAGAAGAAGGTTGTTAATGTTTACAGACTGATAACGAGAGACACGCTGGAAGAAAAGATCATGGGATTGCAAAAATTCAAGCTGATGACCGCCAACACAGTAATCAGCAGTGAGAACGCGGCCTTAGAAACAATGGGCACAGATCAATTGTTGGATCTATTTCAGTCTCCTGGATCAGGACCGAGTTCAGGACCAACGGGAGCGCAGTGTTCTAAATCCCTCATAGAGAACCTCCCCGACCTCTGGGATGACAAACTGTATGAAGAAGAGTATGATATGACCAACTTTATTAAGAGCCTAAAGAAATAA

Protein sequence:

>DPOGS210817-PA
MRCSCLANTFNNCDVFDKRKTLKLNLNSENDFSSSSTYSQSDRFLHHPRLDRLFVLLEAGAGPATRRAAANQLGEVQKAHPEELHRLLARLMKHLRSPAWETRVAATQAVEAILSHVPEWLPPPCTAKEEDIDQDDSSRLRCETFDIERVLEHGAHLMGSEGHEYDLDEETLSATDMKDRLTKQRQNLNSRLGLDVAASLGVDLSGVYSNEDLCIQKTATTPTNRRPVQELVLSKGLSSREMNLAKRKARLAFSKQKSRDCEESPATTPTTPTTTIASIATIASIATTTMEPERKKIKLEQADDTILEVSGCAVPSRDGSWGEGHRWPLGAWSGALTVQLLSGAWEARHGAASAIRELLRAPIVRSAGWKAGMTAQQMEDSHQEWLEDMALRLLCVLALDRFGDFVSDQVVAPVRESSAQALGVCLSRLRAERVSLVARALGGLARHAQWEARHGALLAFKYLLAARREVAIECGLAARLWALLRDQDELAAPAHKYMALLAALMALPAAAAHLHPIDLADVLPRLWPYLDHSTSSVRKATLQTLRTITRPLVTTTSNGQNGGGNEKDDAQSNTDNSQGSDTRVEENGDSNNYLQWTPELLQDAMRHVYQRVLFEHVDDIQEIALQVWENLLHHADLGVILVASCPLLSLWMCLAMQPARLPLDPALLLHTPTKERSCRVRIGGVIQPNNEGEPRPSQKWYLGGSESQSAAIRDANVTKARCLAAELLGHLSCYLVQPAPGIEYKAEDESPIDCYVRVMLVYLRSGSALQRLVAGLVLASWARNSLRYGLYPPHLKHTETREDNLSVVSKLAPPSLVTALHTALNQTLYYDEVALNCNRILQEARDLLAMMKHYKLPVDNEEYNNILRLEQVSMLAAGCEAAVAGARSKRVAASLEERRVALLEAARAATREHAALAVSVQAALSSAVAQLAALPERLNPVVRPLMESIKKEASEELQRLSAGTLAALLAQLVHRQPCPNNKVLSNLKAFLSCDPEFTPRISLETTEEVNGDSGSGDSGGENEGPANQTPSLDKYEGILTLREQQRSAERLVTRRGRPPAHTPADLSLDQLFPHEDEARKLLRIQRRGATMALTSLTEYFGDDLPEKLPRLWEFIIEPFEKVMSDEELENIPEEVVEELISRLQVVEAVAGSVGRGEKSITRFRESKGKDKDARDKSASDSEAGSSSMGGSGHVWARVVSGVGACAALCRARHTALRHMAARALAAIAQRDPHPVMRVLVDELVSSLEHPSPRVRCGAAEALARVVDSLQLHVVPYIALLVVPLLGRMSDHNPSVRTMSTRCFATLIQLMPLDGPAGIPPDLSPNMLQRRNRDKTFLDQLFNPKSIKDYKIPIPVTAELRSYQQAGVNWLRFLNEYKLHGVLCDDMGLGKTLQSVVVVAGSHYERAQSGAPQMPSLVVCPPTLTGHWVFEVTKFIPCQYLKPLPYVGPPVERERLRAQVPFYNLIVASYDIVRKDIDFFSGIKWNYCILDEGHVIKNGKTKAFKAIKQIVANHRLILSGTPIQNNVLELWSLFDFLMPGLLGSERQFTARYSRPLLAARDPRASPHHLQAGALACEALHRQVLPFLLRRVKEDVLRELPPKITQDYYCDLSPLQRRLYEHFSKEHMPQEATHSHTHVFQALHYLQNVCNHPKLVLVESHPETNRVTQQLAAAGSSLDDIQHGAKLPALKQLLLDCGIGSASTGDESAVVSQHRALIFCQLKKMLDIVEKDLIQKHLPSVSYLRLDGSVPPHQRHAIVTRFNTDVSIDVLLLTTAVGGLGLNLTGADTVIFVEHDWNPMKDLQAMDRAHRIGQKKVVNVYRLITRDTLEEKIMGLQKFKLMTANTVISSENAALETMGTDQLLDLFQSPGSGPSSGPTGAQCSKSLIENLPDLWDDKLYEEEYDMTNFIKSLKK-