Monarch geneset OGS2.0

DPOGS213708
TranscriptDPOGS213708-TA1395 bp
ProteinDPOGS213708-PA464 aa
Genomic positionDPSCF300219 + 599286-603200
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0048223e-11471.86% 
BombyxBGIBMGA010679-TA2e-7376.54% 
DrosophilaCG8773-PB6e-5442.80% 
EBI UniRef50UniRef50_B4NKB84e-4842.91%GK13934 n=5 Tax=Drosophila RepID=B4NKB8_DROWI
NCBI RefSeqXP_002056091.14e-5445.08%GJ10415 [Drosophila virilis]
NCBI nr blastpgi|1953949288e-5345.08%GJ10415 [Drosophila virilis]
NCBI nr blastxgi|1953949289e-5144.32%GJ10415 [Drosophila virilis]
Group
Gene OntologyGO:00065084.7e-79proteolysis
GO:00082372.2e-58metallopeptidase activity
GO:00082702.2e-58zinc ion binding
KEGG pathwaydpo:Dpse_GA213105e-53 
 K11141 (ENPEP)maps-> Renin-angiotensin system
InterPro domain[20-461] IPR0019304.7e-79Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
[213-461] IPR0147822.2e-58Peptidase M1, membrane alanine aminopeptidase, N-terminal
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213708-TA
ATGATGATTTCCCCAATAATAAAATGGATTTGTAATCATAAACTTTTAGCATTCTTGGGTGTAGCGACGTTTTGTTTTTTTGTATCTACAGTTGTTCTTTCTGTACAAAATAGCAAATTGAAAGAGTCCTCTCACATTTCCTCCGAAATGGATAACCGATATAGGCTCCCAGATTACATAAAGCCTACAAATTATCAGTTAAAATTGAGTCCAAACATAGCACAAAAAACTTTTGATGGGATAGTTGCTATCTCACTGTACATTACAAAACCAATCAAGACTATCACATTACACACCAAAGATTTGGAAATAAAATCAGTAGATTTTAAAAACAATTTTAATCAAAGTATAGAAGTGTCTTCTTCAAATATAATAGAAATCGCAGAGGTACTACAAGTCAATCTACAGAAGGAAGTAGTGCCAAATACGAATTATAAGCTAGAAATAGAATTTTCTGGCAGATTAGACAAAGGAATCGTTGGTTTTTATTCGAGCACAATGAGGAACCGAGAATCATTTCACGGTTCATTGAAATTGAAACCTGTTCGGTCTCTAATAATTTCTCGCTTTATATCCAATACTTCCATCGATATAATGGAGAATTCGTATAAAGTTAAACTCTCCGAGAAACTTTCCTCTTTGGTTCGACCAGCGCATTACAAACTATTGTTAAATCCGAATCTTAAAACTGGAACTTTCTCAGGAGAGGTTGAGATAAATGTTGTTGTTAAAGAGACGAGGAATTTTATAGCCCTCCACTCAAAATTTTTGGAAGTAAATGACGTAAAGGTAAACAAAAATCGGGAAGAAGTTTCTGTTTCAAAATTTTTGGAAGTTACGTCTTTGGAACAACTTTTGATTCAATTTGACAACAACCTTCCTCCTGGAAATTGTGATATAAGTATCAAATTTAATGGAAATTTAACTCGGAACATTGTTGGTTTTTATTTGTCCCATTTAAAAGACAAGAGTACAATGGTTGCTAGTAAGTTCCAGCCAACTTATGCTCGACAAGCTTTCCCCTGTTTCGATGAGCCAGAATACAAAGCAACATATGACATAACATTAGTCAAACCCAAGGAGTACATCGCCCTGTCTAATATGAATGAAATATCAAGGTCCTTAGCGAATTCTTCAGACTCCGAGGCAGTCACCTTTGCAACCAGTGTTCCGATGTCGACATACTTAGCATGTTTTGTTGTTTGCAATTTTGATTATAAGGAGGTCGATGTTAATGCAAACGGTATAGGAAGTAACTTTAAGTTGCGAGCCTTTGCTCAGAAAGATCAGACGCATAAAATAGATTTCGCTCATGACATTGGGAAACGTGCCACAGAATTTTATATCAATTATTATGAAGTTCCCTTTCCACTTCCAAAGCTGGGTAAGTGTTGA

Protein sequence:

>DPOGS213708-PA
MMISPIIKWICNHKLLAFLGVATFCFFVSTVVLSVQNSKLKESSHISSEMDNRYRLPDYIKPTNYQLKLSPNIAQKTFDGIVAISLYITKPIKTITLHTKDLEIKSVDFKNNFNQSIEVSSSNIIEIAEVLQVNLQKEVVPNTNYKLEIEFSGRLDKGIVGFYSSTMRNRESFHGSLKLKPVRSLIISRFISNTSIDIMENSYKVKLSEKLSSLVRPAHYKLLLNPNLKTGTFSGEVEINVVVKETRNFIALHSKFLEVNDVKVNKNREEVSVSKFLEVTSLEQLLIQFDNNLPPGNCDISIKFNGNLTRNIVGFYLSHLKDKSTMVASKFQPTYARQAFPCFDEPEYKATYDITLVKPKEYIALSNMNEISRSLANSSDSEAVTFATSVPMSTYLACFVVCNFDYKEVDVNANGIGSNFKLRAFAQKDQTHKIDFAHDIGKRATEFYINYYEVPFPLPKLGKC-