Monarch geneset OGS2.0

DPOGS202623
TranscriptDPOGS202623-TA2709 bp
ProteinDPOGS202623-PA902 aa
Genomic positionDPSCF300140 + 433041-445637
RNAseq coverage931x (Rank: top 14%)
Annotation
HeliconiusHMEL0202671e-14946.71% 
BombyxBGIBMGA006539-TA0.062.84% 
DrosophilaAnce-3-PB2e-6929.89% 
EBI UniRef50UniRef50_D2A5332e-8833.21%Putative uncharacterized protein GLEAN_15465 n=2 Tax=Tribolium castaneum RepID=D2A533_TRICA
NCBI RefSeqXP_001659916.13e-9034.01%angiotensin-converting enzyme (dipeptidyl carboxypeptidase [Aedes aegypti]
NCBI nr blastpgi|1571220055e-8934.01%angiotensin-converting enzyme (dipeptidyl carboxypeptidase [Aedes aegypti]
NCBI nr blastxgi|1571220056e-8834.01%angiotensin-converting enzyme (dipeptidyl carboxypeptidase [Aedes aegypti]
Group
Gene OntologyGO:00160203e-151membrane
GO:00082413e-151peptidyl-dipeptidase activity
GO:00082373e-151metallopeptidase activity
GO:00065083e-151proteolysis
KEGG pathwayaag:AaeL_AAEL0093107e-90 
 K01283 (E3.4.15.1, ACE)maps-> Chagas disease
    Renin-angiotensin system
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[2-829] IPR0015483e-151Peptidase M2, peptidyl-dipeptidase A
Orthology groupMCL25254 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202623-TA
ATGATGCTGACACAAGCATTAATTTGCTTGGGAATCATGTCCATTGGGAAGGGCATGCAGTTTTCCAACAACCCGACCTCTGAAATGTCTGCAGATGAAAAAGGACTGGCGTTCAATAAGACAGAGCAATCAGATGAAAGTGGTTATGAAGTGGAAAGTGACGTTGAAGATAAAAAGTATCTCGATGAGGCCATGAAAACAGTTCTAGACCCAAATACAGTAATACTCAAAGGTGACGAGGTTGTTGAAAATTTTAAAAGTCAAATTGATAACTTAGAACATGGAAATGCATCAATGGAAACTTTTATGAAAGAAATGGATAAATTAAGTTTAGAGATTTGCAAAACATCACAGGAATCGTTATGGGATTATGTTACTGATATCAATAGTGAAACTAAGAAAACTAGAATGGTTAGAATCGCAGCTGAAGAAGATGAAATAAAAAAGCAATATTGGACAATATTGAAGACAAAATATTTAAATTCAGCCCACACATCAGATGACGCCCAGCTAGCAAGAAAAGTTAGAATTATTCGAGATCGTGGAACTAATGTCCTGATGCCCCAAAGTAAGCAACGTGAAGAAATTGACACAATGCAGCGAACATGGAGCCGAGTGGCCGTCTGCGCCTACAATATTAGTATTTGCAATGCTGATGATTCGATGCGTACAATGAGCGATATCATAACAATATTCAAAACAAGCAATGATACTAAAGAATTGTCATACTATTGGAGAGCCTATAGAGATGCAACTGGAAAAAAGATAAGACCGATTTTTAAAGATTATGTATTTAGGATGAACAAAGTCGCAATATCAGAAGATTTTAATGACGCCGGTGACATGTGGAGATATGCCTTTGATGACTCAACTTTAAAATTTAAACAAACCGTTGAAAGACTTTGGAACGAAATAAAACCTATATATAATCTTTTACATAGTTTTGTAAGAAAACGAATGGAAATTTATTACGAGGAATTAAAAGGTGACAACAACTTAATACCTGCGCATATTTTAGGTAACCTTTGGGCTCAAGAATGGCAAGCAATTTATCCCAAAGTAGCGCCCTATCCTGATATAGAACGGCCAACAGTTGTAAGCAACGAAACTGTCAATGCAAGGGACCTCTTTCATGCTGTAGATGAATTTCACCAATCACTTGGTTTCGAGTCAGCTCAGGATACCTTTCAGGATATAAAGGAGTCTATGTCCACAGTCAATTGTTTACCATCTAGCCATGACATGTGTGATGGTATTCATTATAAAATAAAATGGTGTGGTGAGAAAATTTCTGATGTCACTATAGGCTTATCAAGAGCAGCGAGATTATTGGGACATGTGCAATATTTTAAGCATTATCGTAATCTAGAGCCTTTGTATAGGGATGGGCCAAATCCAGCCTTTCACGATGCAGTTTCTGATATCGTAGCTGTGCAAATAGCCTCGCCAAATCACTTGAGCACCTTGAACTTCGTGAAGGTGGACACCACTGACAATTCCACTATAAACCACTTGCTTTGGTTGGCCTTAGAAAAATTTCCACTCATGGCATATGCTTACGTCCTTGATAAGTGGAGATGGGATGTGTTTTCTAACAGTAGCATGGAAAATTTAAATCAACATTGGTGGGATTTAAGAGTAAAAGAGACTGGCATCTCAGCGCCAGTTCTTCGTAATGAGAGTGACTTGGATCCTGCTGCTAAATATCATGTTGTCTCTCATGTCCAATATATAACGTATCTGATATCGCACATACTAGAGTTTCAAATATTATCATCTCTATGTAAAAAAGCGAATCACACAGGACCTCTGCACGAGTGTTCCATTTATGGAGTGAAGGAAGCGGGAAAACTCTTGAGGTATCCTGCTTATATTACCACCTTTCACGATGCAGTTTCTGATATCGTAGCTGTGCAAATAGCCTCGCCAAATCACTTGAGCACCTTGAACTTCGTGAAGGTGGACACCACTGACAATTCCACTATAAACCACTTGCTTTGGTTGGCCTTAGAAAAATTTCCACTCATGGCATATGCTTACGTCCTTGATAAGTGGAGATGGGATGTGTTTTCTAACAGTAGCATGGAAAATTTAAATCAACATTGGTGGGATTTAAGAGTAAAAGAGACTGGCATCTCAGCGCCAGTTCTTCGTAATGAGAGTGACTTGGATCCTGCTGCTAAATATCATGTTGTCTCTCATGTCCAATATATAACGTATCTGATATCGCACATACTAGAGTTTCAAATATTATCATCTCTATGTAAAAAAGCGAATCACACAGGACCTCTGCACGAGTGTTCCATTTATGGAGTGAAGGAAGCGGGAAAACTCTTGAGTGACGGAATGTCATTGGGTGCAAGCAAGGATTGGAGTATCGTGTTAAAGACTATGACAGGAGAAACAGAATTATCTACAAGTGGTATTTTAGACTACTTTTCGCCTTTGAAGGAGTTTTTAAACGAGGAGATTAAAAAATTGGAAATACGTAATCATGAGGTAGACAGTAATGCTCCCTTTGTAGTTGGCATTATCGTTGTTATTTTGATTATATTTATGTTCGTGCTGTATTGTTTCAAAAAAAGAGATAAAGTAAGACAATTATTGTCTCTATGCGGTTTAAGTAAAAATGGTTCTTTAGATATAGCTACACAAGAAATACCGAGACGGAAACCTGAAGCTGTGGTTGAAAGGGAAGAAAAAGTTTAA

Protein sequence:

>DPOGS202623-PA
MMLTQALICLGIMSIGKGMQFSNNPTSEMSADEKGLAFNKTEQSDESGYEVESDVEDKKYLDEAMKTVLDPNTVILKGDEVVENFKSQIDNLEHGNASMETFMKEMDKLSLEICKTSQESLWDYVTDINSETKKTRMVRIAAEEDEIKKQYWTILKTKYLNSAHTSDDAQLARKVRIIRDRGTNVLMPQSKQREEIDTMQRTWSRVAVCAYNISICNADDSMRTMSDIITIFKTSNDTKELSYYWRAYRDATGKKIRPIFKDYVFRMNKVAISEDFNDAGDMWRYAFDDSTLKFKQTVERLWNEIKPIYNLLHSFVRKRMEIYYEELKGDNNLIPAHILGNLWAQEWQAIYPKVAPYPDIERPTVVSNETVNARDLFHAVDEFHQSLGFESAQDTFQDIKESMSTVNCLPSSHDMCDGIHYKIKWCGEKISDVTIGLSRAARLLGHVQYFKHYRNLEPLYRDGPNPAFHDAVSDIVAVQIASPNHLSTLNFVKVDTTDNSTINHLLWLALEKFPLMAYAYVLDKWRWDVFSNSSMENLNQHWWDLRVKETGISAPVLRNESDLDPAAKYHVVSHVQYITYLISHILEFQILSSLCKKANHTGPLHECSIYGVKEAGKLLRYPAYITTFHDAVSDIVAVQIASPNHLSTLNFVKVDTTDNSTINHLLWLALEKFPLMAYAYVLDKWRWDVFSNSSMENLNQHWWDLRVKETGISAPVLRNESDLDPAAKYHVVSHVQYITYLISHILEFQILSSLCKKANHTGPLHECSIYGVKEAGKLLSDGMSLGASKDWSIVLKTMTGETELSTSGILDYFSPLKEFLNEEIKKLEIRNHEVDSNAPFVVGIIVVILIIFMFVLYCFKKRDKVRQLLSLCGLSKNGSLDIATQEIPRRKPEAVVEREEKV-