Monarch geneset OGS2.0

DPOGS212862
TranscriptDPOGS212862-TA2322 bp
ProteinDPOGS212862-PA773 aa
Genomic positionDPSCF300086 + 361303-374228
RNAseq coverage1831x (Rank: top 7%)
Annotation
HeliconiusHMEL0081910.088.34% 
BombyxBGIBMGA000806-TA0.084.67% 
DrosophilaCG32626-PH0.068.21% 
EBI UniRef50UniRef50_F5HJE80.075.51%AGAP000577-PC n=60 Tax=Bilateria RepID=F5HJE8_ANOGA
NCBI RefSeqXP_623550.10.075.98%PREDICTED: similar to CG32626-PA, isoform A isoform 2 [Apis mellifera]
NCBI nr blastpgi|3838477910.074.60%PREDICTED: AMP deaminase 2 isoform 2 [Megachile rotundata]
NCBI nr blastxgi|3838477910.075.00%PREDICTED: AMP deaminase 2 isoform 2 [Megachile rotundata]
Group
Gene OntologyGO:00168140hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in cyclic amidines
GO:00061440purine base metabolic process
GO:00038768e-287AMP deaminase activity
GO:00091688e-287purine ribonucleoside monophosphate biosynthetic process
GO:00192393.8e-111deaminase activity
KEGG pathwaynvi:1001235580.0 
 K01490 (E3.5.4.6, AMPD)maps-> Purine metabolism
InterPro domain[1-753] IPR0162970AMP deaminase, metazoa
[127-739] IPR0063298e-287AMP deaminase
[297-703] IPR0013653.8e-111Adenosine/AMP deaminase
Orthology groupMCL10280 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212862-TA
ATGTATAGCTTCGACAAGGATCTGTGGCACGTGAGAGCGCGCCCGACAGGCTCCCAGGATGATACCAAGGAAGAAAATCGTAGTGAAAGTCCAACGAGCGCGGTGGGGGCGGAATCGCCGCGGGAGCTCCCAAATGAGCTCTCAGCCCCTTACGAAGTCCCTCAGTTCCCCATAGAACAGATAGAGAAGAAGCTCCTCATACAACGACAGTTAAATGTCAAGGCAGCAGAATGTGGTCAGTCAGCACGCTCCCTGGCCGGAGACGCCGTGCTGGAGGAGGCGGCGAAGCTTCGGGCCACTGACGATGACTTCGACGTCGTCCTACCACACTTCCAACGGGTGGCCATCAGCGGAGAAGACACGTCGGGGGTACCGCTGGAGGATCTCCAGCAGGCGTCCTCGTACCTGGTCCAAGCTCTAGAAATCCGTAAGCGCTACATGGACATCTCCCAGCAAAGTTTCTGCAGCATCACGGCTCGGTTCCTCCGTAGCATGGACAGCGAGGCTGCCGCCAACCACAAGCCCAGTGTGTCCGTGCAAAAACATATCGCTGATCACATGGTCCATCCGCCATTCAAAGCTAACAAGGATCCGTGGGAGGGGCCCACTCCTTCCGCCAAGGATTACACCATCAAGGCGGACGACGGGGTGTTCAACCTGTACCGGCAGACCGAGGCCGGCGAGGAGCGGGTGCCCTACGAGTACGTCAAGCTGCCGCAGTTCATACAGGACAAGAACACCATGTGTACCATGATAGCTGACGGACCGCTTAAGTCGTTCTGCTACCGTCGCCTCAGCTACCTGTCGTCCAAGTTCCAGCTGCACGTGTTACTGAACGAACTGCGCGAGCTGGCCTCGCAGAAGGCGGTGCCGCACAGGGACTTCTACAACATACGGAAAGTGGACACACACATCCACGCGGCCTCCTGTATGAACCAGAAGCACCTTCTGCGGTTCATTAAGAAGACGCTCAAGAAACACGCTGATGAGGTGGTGACGCTCCACAAGGGCTCCCCTATGACCCTGAAGGCCGTCTTCCAGTCCATGAACCTCAGCACCTACGACCTCACCGTAGACATGCTGGACGTACACGCGGACCGAAACACCTTCCATCGATTCGACAAGTTCAACGCCAAGTACAACCCCATCGGAGAGTCGCGGCTGAGGGAGGTGTTCCTCAAGACCGACAACCACATGAACGGGAAGTACTTCGCACGGATTATTAAGGAGGTGGCGTCAGATCTGGAGGAGAGTAAGTACCAGAACGCCGAACTCCGTCTCTCCATCTACGGCAAGAGTCCCGGCGAGTGGGCCAAGCTGGCCAAGTGGGCCATCCACTATGACGTGCACTCAGACAACGTCCGCTGGCTCATACAGATACCGCGACTATACGACATCTTCAAGTCGAACAAGATAATGAACAACTTCCAGGAGTTCCTGAGCAACATCTTTCTCCCCCTGTTCGAGGTGACCCGGGACCCCAACAGTAACATCGAGCTGCACAAGTTCCTCACCCACGTGGTCGGCTTCGACAGTGTGGACGACGAGTCCAAGCCCGAGAACCCGATGCTGGAGGCGGACGTGCGCGAACCGCGGGCCTGGGCCGACGACGAGAACCCGCCCTACGCCTACTACCTGTACTACATGTACGCTAACATCACCGTGCTCAACCACTTCCGGAAGGAGCAGGGTCTGAACACGTTCGTGCTCCGTCCTCACTGCGGGGAGGCCGGGCCCGTGCAGCACCTGGTGTGCGGCTTCATGCTGGCCGAGAACATCTCGCACGGACTGCTGCTGAGGAAGGTGCCGGTGTTGCAGTACCTGTACTACCTGTCCCAGATCTCCATCGCCATGTCGCCCCTCAGCAACAACTCCCTTTTCCTCAACTACCACCGCAACCCCTTGCCGGAGTTCCTCGCCCGCGGTCTCTGCGTCACCCTCAGCACGGACGACCCGCTGCAGTTCCACTTCACCAAGGAACCCCTGATGGAGGAGTACAGTATAGCTGCCCAGGTGTGGAAGCTGAGCTCCTGTGACATGTGCGAGCTCGCCCGGAACTCCGTGCTCATGTCGGGATTCCCTCATGAGATGAAGCAGTACTGGCTCGGTCCCAACTACACCAAGGAGGGAGTGGCCGGCAACGACATCACACGCACCAACGTCCCGGACATCCGCATCTCCTTCCGCTACGAGACCCTGCTGGACGAGCTCACTAACATGTTCAAGTTCCGTCACAATCCACAACAGAACGGCGTCCGTCCGCCGACGCCTCCTACACCCGACATGAAGGTGGTTAGCCTCAAGAGACCCTCGCTTGTTTAA

Protein sequence:

>DPOGS212862-PA
MYSFDKDLWHVRARPTGSQDDTKEENRSESPTSAVGAESPRELPNELSAPYEVPQFPIEQIEKKLLIQRQLNVKAAECGQSARSLAGDAVLEEAAKLRATDDDFDVVLPHFQRVAISGEDTSGVPLEDLQQASSYLVQALEIRKRYMDISQQSFCSITARFLRSMDSEAAANHKPSVSVQKHIADHMVHPPFKANKDPWEGPTPSAKDYTIKADDGVFNLYRQTEAGEERVPYEYVKLPQFIQDKNTMCTMIADGPLKSFCYRRLSYLSSKFQLHVLLNELRELASQKAVPHRDFYNIRKVDTHIHAASCMNQKHLLRFIKKTLKKHADEVVTLHKGSPMTLKAVFQSMNLSTYDLTVDMLDVHADRNTFHRFDKFNAKYNPIGESRLREVFLKTDNHMNGKYFARIIKEVASDLEESKYQNAELRLSIYGKSPGEWAKLAKWAIHYDVHSDNVRWLIQIPRLYDIFKSNKIMNNFQEFLSNIFLPLFEVTRDPNSNIELHKFLTHVVGFDSVDDESKPENPMLEADVREPRAWADDENPPYAYYLYYMYANITVLNHFRKEQGLNTFVLRPHCGEAGPVQHLVCGFMLAENISHGLLLRKVPVLQYLYYLSQISIAMSPLSNNSLFLNYHRNPLPEFLARGLCVTLSTDDPLQFHFTKEPLMEEYSIAAQVWKLSSCDMCELARNSVLMSGFPHEMKQYWLGPNYTKEGVAGNDITRTNVPDIRISFRYETLLDELTNMFKFRHNPQQNGVRPPTPPTPDMKVVSLKRPSLV-