Monarch geneset OGS2.0

DPOGS206688
TranscriptDPOGS206688-TA1515 bp
ProteinDPOGS206688-PA504 aa
Genomic positionDPSCF300048 + 1256807-1262899
RNAseq coverage1042x (Rank: top 12%)
Annotation
HeliconiusHMEL0065796e-8255.67% 
BombyxBGIBMGA008521-TA0.068.60% 
DrosophilaDip-C-PA2e-14951.69% 
EBI UniRef50UniRef50_B4NJE82e-15352.24%GK12846 n=1 Tax=Drosophila willistoni RepID=B4NJE8_DROWI
NCBI RefSeqXP_001865452.13e-15654.01%xaa-pro dipeptidase [Culex quinquefasciatus]
NCBI nr blastpgi|1700596465e-15554.01%xaa-pro dipeptidase [Culex quinquefasciatus]
NCBI nr blastxgi|3454927263e-15053.74%PREDICTED: xaa-Pro dipeptidase-like isoform 3 [Nasonia vitripennis]
Group
Gene OntologyGO:00099874.6e-70cellular process
GO:00041771.8e-36aminopeptidase activity
GO:00301451.8e-36manganese ion binding
KEGG pathway 
InterPro domain[231-498] IPR0009944.6e-70Peptidase M24, structural domain
[15-137] IPR0078651.8e-36Peptidase M24B, X-Pro dipeptidase/aminopeptidase P N-terminal
Orthology groupMCL11536 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206688-TA
ATGGCTGGTGTTTGGTCTATGGGTCCTGGTACATATGAAGTTCCATTGTCCTTGTTTGCTAAGAATAGAGATAGACTTGCAGAAAAGTTGAAGAGTGGCCAAGTAGTTGTTCTGCAAGGTGGAGATGATATAAATCTCTATGATACTGACATCCAATATGTCTTCCGACAGGAAGCATATTTTACATGGGCTTGTGGCGTACGAGAGCCAGGCTGCTATTTTGCTCTTGATGTAAAAACCAAGAAAAGCATTGTCTTTGTGCCTCGTCTGCCAGATGAGTATGAAATTTGGATGGGCAAACTACTTAGTTGTCAAGATTACACCAATATGTATGGGGTTGATGAAGTCCGCTATGTTGATGAGATCTGTGATGTATTAAAATCACTCGAACCTGATACTTTGCTAACACTTGTAATGGACAATGAAACACTATTTCCGATTATTGCTGAACTGCGCGTCATCAAAACGCCAGAGGAGATAGAAGTAATGCGTTACATATGCAAAGTATCGTCCGATGCTCACAAACAGGAGTCTGCTCCTTCTAACAGTGAAAAAATGAGTACAGGGTTTGCTTCGGAAGAGAAATCGCCGGAGACATTGGCCACGACGCTCGACCTGTGGAAGTGGATACTCCCGAAAAGATTGCACATATCGAAAAGGAGGTTCTTAGAGAGCGCCGTTATGCTCTACGCTAAGCCCGGCCTTCTGGAGTATCAATGCGAATCAGTATTCCTCGATCATTGTTACCGTGTGGGCGGGTGTCGCCACGTGTCCTATACATGTATATGCGGCTCGGGTGACAATTCTGCCATTTTGCACTACGGACACGCCGCAGCTCCGAATAATAAGATGTTAAAGGATGGGGATATATGTTTATTCGACATGGGTGGCAACTATGCTGGGTACGCCGCAGACATCACATGCTCTTTCCCTGCTAATGGAAAGTTCACTGAAGATCAGAAGCTCATATATGAAGCTGTGCTCGCTGCAAGAGATGCGGTTATTAGACAAGGAAAACCGGGAGTCAAATGGACGGACATGCATCTAGCTGCGAATAGAGCCATGTTGGAACATCTCAAGAGAGGTGGACTCTTGAAGGGAGAAGTGGAGAAAATGATTGCGTTTGGTGTGAATGGCATCCTTCAACCTCATGGCCTCGGTCACTTGTTGGGTCTAGATGTGCATGATGTAGGGGGTTACCTCAAGCACTGCCCTCCCAGACCCAGCGGGCCCCTTGGAAGACTAAGAACTGCTCGGATCTTGGAAGCCGGCATGATCCTCACTATTGAACCCGGATGTTACTTCATACCAAAGTTGTTGGATGCAGCTAAACGTACCCAGAAACTAGCGCAGTTCTTTAACTGGGATGTAATGGATAGATTCAGAGGCTTTGGCGGAGTTCGCATAGAAGACGACGTGCTCATCACAGACAAGGGCGTCGAAAATCTCACATTCGTGCCAAGAACTGTTGCGGAAATAGAAGAGTTCATGGCCAATGGCGCAAACTTCAAGTAA

Protein sequence:

>DPOGS206688-PA
MAGVWSMGPGTYEVPLSLFAKNRDRLAEKLKSGQVVVLQGGDDINLYDTDIQYVFRQEAYFTWACGVREPGCYFALDVKTKKSIVFVPRLPDEYEIWMGKLLSCQDYTNMYGVDEVRYVDEICDVLKSLEPDTLLTLVMDNETLFPIIAELRVIKTPEEIEVMRYICKVSSDAHKQESAPSNSEKMSTGFASEEKSPETLATTLDLWKWILPKRLHISKRRFLESAVMLYAKPGLLEYQCESVFLDHCYRVGGCRHVSYTCICGSGDNSAILHYGHAAAPNNKMLKDGDICLFDMGGNYAGYAADITCSFPANGKFTEDQKLIYEAVLAARDAVIRQGKPGVKWTDMHLAANRAMLEHLKRGGLLKGEVEKMIAFGVNGILQPHGLGHLLGLDVHDVGGYLKHCPPRPSGPLGRLRTARILEAGMILTIEPGCYFIPKLLDAAKRTQKLAQFFNWDVMDRFRGFGGVRIEDDVLITDKGVENLTFVPRTVAEIEEFMANGANFK-