Monarch geneset OGS2.0

DPOGS209790
TranscriptDPOGS209790-TA2130 bp
ProteinDPOGS209790-PA709 aa
Genomic positionDPSCF300117 - 935586-942330
RNAseq coverage993x (Rank: top 13%)
Annotation
HeliconiusHMEL0090040.065.97% 
BombyxBGIBMGA008015-TA0.058.79% 
DrosophilaCG40470-PA1e-16440.95% 
EBI UniRef50UniRef50_Q7PRZ00.050.00%AGAP000129-PA n=8 Tax=Neoptera RepID=Q7PRZ0_ANOGA
NCBI RefSeqXP_001861871.10.049.93%protease m1 zinc metalloprotease [Culex quinquefasciatus]
NCBI nr blastpgi|1700516710.049.93%protease m1 zinc metalloprotease [Culex quinquefasciatus]
NCBI nr blastxgi|1700516710.049.93%protease m1 zinc metalloprotease [Culex quinquefasciatus]
Group
Gene OntologyGO:00065083.4e-255proteolysis
GO:00082373.7e-37metallopeptidase activity
GO:00082703.7e-37zinc ion binding
KEGG pathwaynvi:1001242864e-74 
 K11140 (ANPEP)maps-> Glutathione metabolism
    Renin-angiotensin system
    Hematopoietic cell lineage
InterPro domain[1-701] IPR0019303.4e-255Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
[1-202] IPR0147823.7e-37Peptidase M1, membrane alanine aminopeptidase, N-terminal
Orthology groupMCL16494 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209790-TA
ATGTTCCCATGTTTTGATGAACCTGGATACAAAACTCCCTTTGAACTGAGCGTCGTGCGACCGAGGGATATGGTAGCACTTAGCAATGTTCCTGTCGCTAGGACAGAAGATATTAACGATGAACCAAACGCCGTCTGGGATCATTTCGAGAGGACTCCCCCAATGTCAACGTTCACGCTCGGCCTTGTCATCGCTGACCTCAAACAATTTGGCAGTGCCATACATTATGAAGACGAAAATGGAAACAATATTGAAATACGTGTTTGGGGTCGTCCAGAATTTGTAGAAATGTTAGAAGGTCTCAATGAGAAAGTGGCTCAAGTGTTTTCTGAAGTCGCAAACTTCTGGCAAGTTCCGCTACCATTACGCAGATTGGACATAGTGGCTTTACCAAACTATCAAGGGGTAAAGCCCGCCGATAATTGGGGTTTGATAGTTTTTAAGGAAAGCGATTTGTCCTCACGAGGCTACTTGCAGCTGTCCCAGGAGTTGTCCTACCAGTGGCTAGGCGCTCTCGTCTCTCCAGCTTGGTGGAGCGATGCTCATCTCAACAAAGCACTAGTTGGATACCTCGCTGCGGAGATTGCATTTAAAATTAACAATGGTTCAGAGATGGAAGGAAAATGGCCGATGACGGTTCTTTATTCTCTGTACTACGAGTTCAGTAAACGATATCCACACTCTCGGATCACCGGCATGAAACAAGAGACGGCTTGCACCAAAATAGAGTTATTGTTCCGAATGTTCAATTATACTATTGGTGGAGACACTTTCAGAAAAGGAATGAGGAAATTCATTGAGTCAAGGAAGTTTAAGACTTTTACTGGTGATGATATTTGGAATGCCCTCAACGAAGCCGCATTAGCAGATGGCAAGATTCCGAAAGATATTAATATTAAAACAGTAGCCACCAGTTGGATAGAAAAAGACAGACTTCCAGTCATTACAGTTAAGAGGAATTACGAAACCAATACGGCTTTTGTAACTCAGAAAGTGTATCTTCGCGAACGTCCCCACGATCTGCCTTCATCTAATAAGATGTTGTGGAGCGCTCCGCTGGTCGTGTGTCGTTCTGACAGACTCTCCTTCGAAGACTTCACGCCTTCCTCCTGGATCAGACACACAGACCTCAACCTGCTCAACATGCCGGATGACAAGCACTTCATCATCGTCAACCCTGAAGAAATTGGTAAGCGAAAATCTGTGAAGGGAGACGAAATCATCCACCGTGACGCATTTTCATCTCCGATCAAAAAATCGATTATAATAAAAACAAATACTGTCATTACTGAAATATCACACATGGAAAATTATATTTCAGACAGGTCGTATAAGTTTTGTTTGAGGGCGTACGTTCGCACGCTTTTGACACCTCTATATAACGAAATGGTAAGAGACGTTAAAGACGACGGAGATAACAGAAGAAAGAGTCTGTACTCGTTGACCAAAACCTTCTTATGTCAAGTTGGCTTCAAGCCTTGTATTATGGAGGCCCAAGAGCAGTTCAGTAATTGGATGCAGGCACCAAATCCTGATGAAGGAAATCCCATTTCTAATCAGTACATCTGTCCGGTATTCAAATGGGGAACGCAAAAGGAGTGGGATTTTGGACTACAAAGGGTTATAAACTTCCCACCATCGAGAAAACAAAGCGAAAGAACGTATCTCCTTAAAACATTAGCCGGTTGTCCAGTGGATGAGAAGAAAATAGAGAAATTGTTAAATATAACGATTCTAGAAGGCAACGGAAACTTTACGGAAACGGATTTATTCTTAATATTCAGTATGCTGACGGGAAATTCTCAAGGTTACACAACCTTATTCTATTTCCTCAACAACAATTGGGACGTGCTCAAAGAAAAATTCTCGAGTAAGACAAATATTTGGGATAACATGATAACGTCCGCGACCTCGCAGTTCACGACGAAGCCTGGTATGAATCTGGTGGCTATAATGTACGACAATCACAAAGGCGAATTCGGTTCAGCCGAGCATATTATAGAGAAATCGCTGAGAAACATTCGCGAGGAAGCTAAATGGTCAGAAGAAAACATTCCTGTGATAGAGACATGGCTAGACGACTACCTCTCAAGAACCGAGGTCAAAGACGACCAGGCGATCGACGCTTAA

Protein sequence:

>DPOGS209790-PA
MFPCFDEPGYKTPFELSVVRPRDMVALSNVPVARTEDINDEPNAVWDHFERTPPMSTFTLGLVIADLKQFGSAIHYEDENGNNIEIRVWGRPEFVEMLEGLNEKVAQVFSEVANFWQVPLPLRRLDIVALPNYQGVKPADNWGLIVFKESDLSSRGYLQLSQELSYQWLGALVSPAWWSDAHLNKALVGYLAAEIAFKINNGSEMEGKWPMTVLYSLYYEFSKRYPHSRITGMKQETACTKIELLFRMFNYTIGGDTFRKGMRKFIESRKFKTFTGDDIWNALNEAALADGKIPKDINIKTVATSWIEKDRLPVITVKRNYETNTAFVTQKVYLRERPHDLPSSNKMLWSAPLVVCRSDRLSFEDFTPSSWIRHTDLNLLNMPDDKHFIIVNPEEIGKRKSVKGDEIIHRDAFSSPIKKSIIIKTNTVITEISHMENYISDRSYKFCLRAYVRTLLTPLYNEMVRDVKDDGDNRRKSLYSLTKTFLCQVGFKPCIMEAQEQFSNWMQAPNPDEGNPISNQYICPVFKWGTQKEWDFGLQRVINFPPSRKQSERTYLLKTLAGCPVDEKKIEKLLNITILEGNGNFTETDLFLIFSMLTGNSQGYTTLFYFLNNNWDVLKEKFSSKTNIWDNMITSATSQFTTKPGMNLVAIMYDNHKGEFGSAEHIIEKSLRNIREEAKWSEENIPVIETWLDDYLSRTEVKDDQAIDA-