Monarch geneset OGS2.0

DPOGS214936
TranscriptDPOGS214936-TA1377 bp
ProteinDPOGS214936-PA458 aa
Genomic positionDPSCF300280 - 184017-186687
RNAseq coverage7x (Rank: top 87%)
Annotation
HeliconiusHMEL0034061e-1441.18% 
BombyxBGIBMGA001639-TA1e-3856.45% 
DrosophilaApepP-PA4e-2041.46% 
EBI UniRef50UniRef50_F2YHL86e-3253.23%Aminopeptidase P-like protein (Fragment) n=1 Tax=Ostrinia nubilalis RepID=F2YHL8_OSTNU
NCBI RefSeqXP_560264.32e-2043.09%AGAP001037-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3264544822e-3153.23%aminopeptidase P-like protein [Ostrinia nubilalis]
NCBI nr blastxgi|3264544825e-3153.23%aminopeptidase P-like protein [Ostrinia nubilalis]
Group
Gene OntologyGO:00167871.1e-10hydrolase activity
KEGG pathway 
InterPro domain[331-429] IPR0005871.1e-10Creatinase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214936-TA
ATGGCGGGTTATTCTAACAACGGCCGCGCTCACATGTACGTTATCACGCACACACGCACACGGTCCCAGAGACCGGATAAAAGTCAATTACTACAGGAATGTATGACATTGCGCAATATTACTGACGACGAAACTATCAGTTCGACGTTGACGTATAATACGACAGCTGGAGTCACTGAAAACTTACACGTAAAGAATTCTAGCTCTGCAAATGTACGAAAAAATCAAATTGAGTCTAATAACAGTGTCTCATTAGCAACGTATGATGCAAATATTAAGACGAATAATGATGCTATACAGTATTACGATGGGAAACCGAAATTAGACAGTGTGTTGAAATTCTGGAACGATATGAAATTCGAGCATAAGATAACAACGGAAACCGTAAGAGAAAAGTACGATCCAGAAATATCTTACGTTCCAGAGACGAAAAACAAACGTGACGTAGACGTTGAGCCCGATGAAGAGGATGATATAAAGAAGTTACTTGACAACTACTTTAAAAATAAAAAGATTAAAAGCGGAAAACACGACACAGAGGCCACGACACAAAGAGGTTTCATTGACGACTATAGAGGCTATGCAAAGTCAAGACTTAGTAACAAGAGAGCTGATGAATTTGGACCGGATCATAATCTGCCATATTTGTTGCCTTTTGGCGGTCAAAAACACTCTAGAAGTCCACTTAGGATTATCTCTGATAAGGATGAGAGTGGATCCCACCGTGATTATGAAGACAACAGTAAATATACCGACGACAACAGAAAAATATACCAGAATCCGTACTCGAGGAGACTAAAAGACAAGGAGAGATTTGTCGTCACTACGGAGAGACAGAGAGAAAAGAATTTCTCAAAAGACATGATAAAAGAAATAGCAGACAGTGTTAAGGAACTTGTTCTTAGAGATCTAAAACTGAAACTTCAAGAGACTACAGCGAGGACTACGACTCAAACTACACGACCAAGATTTAAGGCAAGATTTTTTTTTCTGTCGGGGTCCGCCGGTACAGCCCTCGTAACAGCCGACCACGCCTTACTGTGGACTGACGGCAGATACTTCACGCAATTCGATATGCAAGTTGATCCTCGTATTTGGACTCTCATGAGGATCGGTACTGATGTAACGATCGAGAGTTGGCTAGCGTCTAACATGACGAGAGGTTCAAGAGTTGGTATTGATCCAACGACCTACACACGCAGTTCTTGGACAACTTTGGAGAATGCTGTGCGTAACGCAAACATAAGTATTGTACCAATTTACGATAACACGGTGGACGAGGCTAGAAGAAGGGTTTCGGACCCCCCTCCCGCCAGGCCTAACGAGCCGTTGTTGGCGCTCACAGTAAATTTCACGGGTAAGTTACATATAAGTTAA

Protein sequence:

>DPOGS214936-PA
MAGYSNNGRAHMYVITHTRTRSQRPDKSQLLQECMTLRNITDDETISSTLTYNTTAGVTENLHVKNSSSANVRKNQIESNNSVSLATYDANIKTNNDAIQYYDGKPKLDSVLKFWNDMKFEHKITTETVREKYDPEISYVPETKNKRDVDVEPDEEDDIKKLLDNYFKNKKIKSGKHDTEATTQRGFIDDYRGYAKSRLSNKRADEFGPDHNLPYLLPFGGQKHSRSPLRIISDKDESGSHRDYEDNSKYTDDNRKIYQNPYSRRLKDKERFVVTTERQREKNFSKDMIKEIADSVKELVLRDLKLKLQETTARTTTQTTRPRFKARFFFLSGSAGTALVTADHALLWTDGRYFTQFDMQVDPRIWTLMRIGTDVTIESWLASNMTRGSRVGIDPTTYTRSSWTTLENAVRNANISIVPIYDNTVDEARRRVSDPPPARPNEPLLALTVNFTGKLHIS-