Monarch geneset OGS2.0

DPOGS206487
TranscriptDPOGS206487-TA1122 bp
ProteinDPOGS206487-PA373 aa
Genomic positionDPSCF300381 - 44821-47205
RNAseq coverage651x (Rank: top 20%)
Annotation
HeliconiusHMEL0054572e-14080.95% 
BombyxBGIBMGA002394-TA0.088.74% 
DrosophilaCG13630-PA4e-16373.58% 
EBI UniRef50UniRef50_Q4QRK03e-15468.78%Methionine aminopeptidase 1 n=168 Tax=root RepID=AMPM1_DANRE
NCBI RefSeqXP_967283.13e-18079.25%PREDICTED: similar to methionine aminopeptidase [Tribolium castaneum]
NCBI nr blastpgi|910922625e-17979.25%PREDICTED: similar to methionine aminopeptidase [Tribolium castaneum]
NCBI nr blastxgi|910922620.079.25%PREDICTED: similar to methionine aminopeptidase [Tribolium castaneum]
Group
Gene OntologyGO:00099872.5e-108cellular process
GO:00065081.2e-89proteolysis
GO:00041771.2e-89aminopeptidase activity
GO:00082351.2e-89metalloexopeptidase activity
KEGG pathway 
InterPro domain[63-358] IPR0009942.5e-108Peptidase M24, structural domain
[112-358] IPR0024671.2e-89Peptidase M24A, methionine aminopeptidase, subfamily 1
[177-190] IPR0017145.8e-18Peptidase M24, methionine aminopeptidase
Orthology groupMCL12907 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206487-TA
ATGTGCGAAACCCCCGGCTGTAAGTCAATTGCACAACTACAGTGTCCAACATGTATAAAACTTGGGGTGCAGGGATCTTTTTTCTGCAATCAGGAATGTTTCAAGAAGTCGTGGAAGTCACACAAACTAATCCATTCCTTGGCGAAAGGTGAAAAGACAGATGTATCTGGTATTGAGTTCAATCCTTGGCCATCTTACAACTTCACGGGGAAGTTGAGGCCGTTTCCTCCGGGTCCAAAGCGCACTGTTCCCAGTCATATAGGGCGGCCTGACTATGCCGACCATCCAACCGGGTTCCCGGCGTCGGAAAATGCCGCAAAAGGTTCCGGACAAATCAAAGTGCTAGATGACGAAGAGATAGAAGGAATGAGAGTTGCCTGTCGCCTTGGACGAGAAGTTTTAGACGAAGCTGCAAAAGTTTGCGATGTTGGTGTCACAACTGATGAAATTGACAGAGTTGTACATGAAGCATGTATTGAAAGAGAATGCTATCCAAGCCCTCTAAATTATCACAATTTCCCAAACAGCTGTTGTACTTCAGTAAATGAGGTCATATGCCATGGAATTCCTGACTTGCGTCCGTTGGAGGATGGTGATTTATGCAATGTAGATGTAACAGTCTATCACAGGGGCTTCCATGGTGACTTAAATGAAACCTTTTTTGTTGGTAATGTACCGGAAACATCTCGAAAACTGGTACAAGTAACTTATGAATGTTTACAAAAAGCTATAGAAATTGTCAAGCCCGGAGAGAAGTACAGAGAAATTGGTAATGTTATTCAGAAACATGCCCAAGCGAATGGCTTTAGTGTGGTAAGGTCATATTGTGGGCACGGTATACATAGATTGTTCCATACAGCACCGAATGTGCCTCATTATGCAAAGAATAAAGCTGTGGGAGTAATGAAACCAGGGCACTGTTTCACTATTGAGCCGATGATTAACGAGGGGGCCTGGAGGGACGAACAGTGGCCGGACAATTGGACAGCAGTTACAGCCGATGGATCAAGATCTGCTCAGTTTGAACAAACTCTCTTGGTAACTGAGACTGGGTGTGACATACTAACAAAACGAGGCGTCGGCTATCCGTGGTTCATGGACCAATTAAAAAAACTACAATAG

Protein sequence:

>DPOGS206487-PA
MCETPGCKSIAQLQCPTCIKLGVQGSFFCNQECFKKSWKSHKLIHSLAKGEKTDVSGIEFNPWPSYNFTGKLRPFPPGPKRTVPSHIGRPDYADHPTGFPASENAAKGSGQIKVLDDEEIEGMRVACRLGREVLDEAAKVCDVGVTTDEIDRVVHEACIERECYPSPLNYHNFPNSCCTSVNEVICHGIPDLRPLEDGDLCNVDVTVYHRGFHGDLNETFFVGNVPETSRKLVQVTYECLQKAIEIVKPGEKYREIGNVIQKHAQANGFSVVRSYCGHGIHRLFHTAPNVPHYAKNKAVGVMKPGHCFTIEPMINEGAWRDEQWPDNWTAVTADGSRSAQFEQTLLVTETGCDILTKRGVGYPWFMDQLKKLQ-