Monarch geneset OGS2.0

DPOGS201356
TranscriptDPOGS201356-TA3375 bp
ProteinDPOGS201356-PA1124 aa
Genomic positionDPSCF300083 - 519525-529778
RNAseq coverage316x (Rank: top 36%)
Annotation
HeliconiusHMEL0147110.068.07% 
BombyxBGIBMGA002149-TA0.061.66% 
DrosophilaCG32649-PA2e-15645.95% 
EBI UniRef50UniRef50_E2C5315e-17964.38%Uncharacterized aarF domain-containing protein kinase 4 n=2 Tax=Formicidae RepID=E2C531_HARSA
NCBI RefSeqXP_001605712.10.065.92%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3838658010.051.32%PREDICTED: chaperone activity of bc1 complex-like, mitochondrial-like [Megachile rotundata]
NCBI nr blastxgi|3838658010.051.78%PREDICTED: chaperone activity of bc1 complex-like, mitochondrial-like [Megachile rotundata]
Group
Gene OntologyGO:00167723.2e-10transferase activity, transferring phosphorus-containing groups
KEGG pathway 
InterPro domain[798-913] IPR0041471.5e-33ABC-1
[808-997] IPR0110093.2e-10Protein kinase-like domain
Orthology groupMCL13379 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201356-TA
ATGATGCAATTAATCTTAGACTGCGTATTTTCACCACGTTTGTACAGAATATATGGACAAGAGGACGGCGATGGTGCCTATGAACCTAATATGATGGAACAAGTCGCTACTAAACTACTCTCAGCGACTAGAACTATTGTAAACATCGGTATATATACATCGCCATTGTTGTGTATGTATATTTTTAAAAGAGGCTTTTTCTCCGTTGACGAGCTGAACACGCTCTTCCATCTCATCGGAACTGTCGGATGTTTCTTTTCTTTGACCCTGTTCATGAGATCACTAGGGAGAGCTTCCAATCCCGACTACGTGGAATTTTTGCATACTCTCTACCGGCCCGTTGTTGACGAGAAGAGTTACATTGAAAGCATCCGAAAATATGATTTCGATTTCAATCGTTGGCCAATTTCTTTTATCATGGACCCGATTGCCAGTGAACCTGCTGTCAATCCATTCTCCAAATGTGCTAATCCAGAACTACCATATTACAAGAGAGTTACCATACAAATACTAGCCTACTTCGCCATTAAAACTTTCGCCATACGCCTCATATACCCTGGTTGTGTTGGGGTGCTCCGGAACATACTATGGACGGCGCTAGAAGAGGGTAGACTGAACCTCGTCGAGGTTTATAATGGTAAAAGGGCCAGAGTTGTGACAGCTGATGGAAATACTATCGATACGATGTTTGTCGACAACCGATTGACTTCTTTGAATGGGAGGATCCTTGTGATCTGCAGTGAGGGTAACTCTGGCTTTTATGAAGTTGGCATAATGGTAACGCCAGCTAAAGCCGGTTTCTCGTCCTTGGGGTGGAACCATCCAGGATTTGGCGACAGTACGGGATATCCGTTTCCACGTCAAGAACAAAACGCCATTGATGCTGTAGTCCAATACGCTATTCACGAACTTGACTTCAACGTTGAGAATATTGTCATGTTTGGTTGGAGCATTGGAGGCTATGTATCCGCCTGGGCCGCTGCTTCATATCCTGATATGAGGGGTCTTATCTTGGATGCTTCATTTGACGACCTCTTGCCCCTGGCTCTCAAACAAATGCCCAGGTCATGGAATTATCTTGTGAAGGAAGTTATTCGTTCTTATGTTGACCTTAACGTTGGCGATTTGCTTGTGAGATATCGTGGCGCCGTACAGTTGGTTCGACGTACTGAAGATGAAGTTATTTGCATACGCCAAGGCCAGCTTGCTACCAACCGCGGAAATTACCTGTTCCTGAGACTTGTAGTGAGTCGTCACCCGGAATTTTTCGAGGAACTAAACGGAGAGAAGCCAGTGATTGATCTACTTCAGGCTTGCGTTGCTCTGTCTGATCAACAACGCATCGTGCTCTCTAGGACAGATTTACCTGAGTCAAAGAGGAGGCTCCTTAGGATGATTGAGAAGTACATGCGTGACTACCGATCATCGCACTGCAGTGTGTTGCCAGAATCAGACTTCAAAGCAGCTATGACACACATTAATGACTTCATAGGTGTGGTGAGAGGTCTCCGGCAGGTACTAGAAGCTGGTATTAAAATCCAACAAGAAAATTCTAGACTAATATGGAATAACTCTAGTTTTCGGCCTTCTCTGCAGTCTTGTCCTACTAATGCTCTATCTTATAAGCCTAGTGCTGATATGTCTAGCGATGTTTTTGATAGGGCCATGGTTGTTATTCATGGTGTCAAAGAATATGTCACTATGTATAGAACTAATCCAATTAATAATGTACATAGTGCATCTAGTATGGATCCACAACTGCAAGAAGAAATTGAATTGCTCAACAAAGAGTTCAATGAGACTTTTGAGAATTTAAAACAGACTCAAAAGAAAATAGTTTCTACTACAATAACATCACCCTCTGAACAAGTTTTGAAACCAATTGATAAAGTTGAAGAAGTTGCAAGACCTGTAATAAGACCAGAAGCTTCAAAACATTCAAAAGTTCCTTCTGTAGAAAAAGTTGTACCAGTCGCAGAGGCAAGCAGTTTGTCTATTCCAAAGCCTGTAGCTAAAAAGAAGATGAAGGTTTCTCTTAGTGAAAACTCAAAAGCTCGTGTGGTTCCCTCTTCTCGTATAGGCAGGATGATGTCTTTCGGGTCGCTGGCCGCTGGGCTTGGTGTGGGTACAGTAGCACAGTATGCAAGAAACACTCTACAGTCCATGACAGGAAAAACGGATGATTCAGCAAATGTTTTCCTGTCACCCGCCAATGCAGAACGCATTGTGGATACCCTGTGTAAAGTCAGAGGTGCAGCTTTGAAGTTGGGACAACTGTTGAGCATCCAGGATGAGTCTGTTATACCATCGGATCTGCAGCGTATATTTGACCGGGTTCGTCAGTCAGCTGACTTCATGCCGGTATGGCAAGTGGAGAAGGTCATGAGTTCTCAACTGGGGACCGATTGGAGGACTAAGATACAACACTTTGAAGAGCAGCCCTTTGCTGCAGCTTCCATTGGGCAAGTGCATCTAGGGGTGCTACATAATGGTCAAGAGGTGGCCATTAAAGTACAGTATCCTGGTGTTGCCCAAGGAATCAACAGTGATATTGACAACCTCGTTGGAGTGTTGAAAGTGTGGAATATGTTCCCAAAAGGAATGTTCATAGACAATGTAGTGGAAGTCGCAAAGAAGGAACTTGCATGGGAAGTGGATTACAGAAGGGAGGCAGAATGCACTAAGAAATTTAAACAACTGCTTTCTTCATACAATGAGTACTTTGTACCTGCGGTTATAGACGAACTCTGTGCCCAGGAGGTAATAACAACAGAGTTGATAGATGGTACACCCCTCGACAAACTCTTTGATGCTGACTATCACGTTAGATATGACATCGCTTACAAGATCATGCAGCTCTGTCTGCGTGAAATGTTTGTGTTGAGATGTATGCAGACAGATCCGAATTGGGCTAATTTCTTTTACAACACAAACACCAAGCAGGTAATTCTTTTGGACTTTGGTGCAACTAGAGAATATTCGAAAGACTTCATGGACCAATACATTCAAATAATTAAAGCTGCTTCTATGGGTGATCGCGCCGCCATATTGAAGAAGTCTTTGGAAATGAAATTCCTTACAGGATATGAGTCTAAGATAATGGAAGAAACGCACGTGGATATGGTAATGATAATGGGCGAGGTTTTCACTATGGAAGGCGAAGAATTTGATTTTGGCACGCAAAAAACTACACGACGTATACAGAGTTTAGTGCCCACAGTGCTAACTCACAGACTGTGCCCACCGCCGGAGGAAATATATTCGCTGCATAGAAAACTGTCCGGAGTATTCTTACTCTGTTCAAAACTCAAAGTCAAAATGAACTGTAGGGACATGTTTAATGAAATCTATGATCAATATCAGTTCTCGACGTAA

Protein sequence:

>DPOGS201356-PA
MMQLILDCVFSPRLYRIYGQEDGDGAYEPNMMEQVATKLLSATRTIVNIGIYTSPLLCMYIFKRGFFSVDELNTLFHLIGTVGCFFSLTLFMRSLGRASNPDYVEFLHTLYRPVVDEKSYIESIRKYDFDFNRWPISFIMDPIASEPAVNPFSKCANPELPYYKRVTIQILAYFAIKTFAIRLIYPGCVGVLRNILWTALEEGRLNLVEVYNGKRARVVTADGNTIDTMFVDNRLTSLNGRILVICSEGNSGFYEVGIMVTPAKAGFSSLGWNHPGFGDSTGYPFPRQEQNAIDAVVQYAIHELDFNVENIVMFGWSIGGYVSAWAAASYPDMRGLILDASFDDLLPLALKQMPRSWNYLVKEVIRSYVDLNVGDLLVRYRGAVQLVRRTEDEVICIRQGQLATNRGNYLFLRLVVSRHPEFFEELNGEKPVIDLLQACVALSDQQRIVLSRTDLPESKRRLLRMIEKYMRDYRSSHCSVLPESDFKAAMTHINDFIGVVRGLRQVLEAGIKIQQENSRLIWNNSSFRPSLQSCPTNALSYKPSADMSSDVFDRAMVVIHGVKEYVTMYRTNPINNVHSASSMDPQLQEEIELLNKEFNETFENLKQTQKKIVSTTITSPSEQVLKPIDKVEEVARPVIRPEASKHSKVPSVEKVVPVAEASSLSIPKPVAKKKMKVSLSENSKARVVPSSRIGRMMSFGSLAAGLGVGTVAQYARNTLQSMTGKTDDSANVFLSPANAERIVDTLCKVRGAALKLGQLLSIQDESVIPSDLQRIFDRVRQSADFMPVWQVEKVMSSQLGTDWRTKIQHFEEQPFAAASIGQVHLGVLHNGQEVAIKVQYPGVAQGINSDIDNLVGVLKVWNMFPKGMFIDNVVEVAKKELAWEVDYRREAECTKKFKQLLSSYNEYFVPAVIDELCAQEVITTELIDGTPLDKLFDADYHVRYDIAYKIMQLCLREMFVLRCMQTDPNWANFFYNTNTKQVILLDFGATREYSKDFMDQYIQIIKAASMGDRAAILKKSLEMKFLTGYESKIMEETHVDMVMIMGEVFTMEGEEFDFGTQKTTRRIQSLVPTVLTHRLCPPPEEIYSLHRKLSGVFLLCSKLKVKMNCRDMFNEIYDQYQFST-