Monarch geneset OGS2.0

DPOGS205560
TranscriptDPOGS205560-TA1974 bp
ProteinDPOGS205560-PA657 aa
Genomic positionDPSCF300099 - 30910-48466
RNAseq coverage402x (Rank: top 30%)
Annotation
HeliconiusHMEL0210240.068.98% 
BombyxBGIBMGA002526-TA7e-12844.35% 
DrosophilaAnce-3-PB0.051.68% 
EBI UniRef50UniRef50_E3WR650.059.90%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=E3WR65_ANODA
NCBI RefSeqXP_318843.30.058.47%angiotensin-converting enzyme 7 (AGAP009757-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123823370.059.90%hypothetical protein AND_05023 [Anopheles darlingi]
NCBI nr blastxgi|3123823370.059.90%hypothetical protein AND_05023 [Anopheles darlingi]
Group
Gene OntologyGO:00160203.6e-252membrane
GO:00082413.6e-252peptidyl-dipeptidase activity
GO:00082373.6e-252metallopeptidase activity
GO:00065083.6e-252proteolysis
KEGG pathwayaga:AgaP_AGAP0097570.0 
 K01283 (E3.4.15.1, ACE)maps-> Chagas disease
    Renin-angiotensin system
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[2-657] IPR0015483.6e-252Peptidase M2, peptidyl-dipeptidase A
Orthology groupMCL16832 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205560-TA
ATGGCCTGTGCTATAGGACGGATACTTATAAGAACTCTATTTCAATTTACAATAGTTCATATTTTATATGCTGACCCTCAGCTGGACCTTCCCCAACTGCCCCAAGTGTCTTCACAGCGTACTCCTTATGGCTATGGATTTTCCAGTACACAGTCCAATTTACCCACAAGCCAGTTTGATTTAAACCGTAATTTTCAATATAGTACCGCCAGAGCACCGAGTGCTGTCACTCCGGGTGTGTATCCTGTATCTTCTACCACGCATCAGATAAACCCGGACATAAATAATAACGGATATCCCAGTTTAGATTACGGGAACGGACGGAATCCAAGTTCCACTCTTCGTCCATTCGATGATAGAAATCGTGATGATAGAATAGACATCACGAATGACGCGAATTTTCGTCGGAATGATCCAAATTTCGCTAGAAACGATCCCAGCTATGTTGGGACCAACCCGTATAATTTCAACGGTGATGTGAACTCTAATCGTATAGATGAAAGGCTTAATCACGCCAGCCTGCAGCAAATTAAGGATTTTCTTCACCAAGCTGATGCACAGGCTTCTAAGGAGTGTACGAACAACGTAGCAGCTCAGTGGAATTTTGAAACCGATGTTAATGATGCAACTCAGCATGCTGCTTTGGAAGCACAACAACGCTACACGTTATTCCAGCGAGGGTTGTGGGAAGCGGCTCAGGGCCTGCCTCGTGGTGCTATCAGGGATTTCTCTACCTTCAGACAACTGAAGTTACTATCTACCATCGGTCCGGCAGCTCTGCCACCAGATCAATTGGACAGGTATAACAGAATCATTAACGATATGCTGGCTGTCTACAATTCAGCGGAGATCTGTGCCTATAATGAACCCTTCAAGTGCGGGCTTCACCTCCAGCCAGACCTACAGTTCACCATGTCACATTCCAGAGACTGGGATGAGTTACAACATGTCTGGACAGAATGGAGGAGGAATACTGGAAGACGGATTAGAGATTTGTACGAACAACTCGTTGATCTCACCAATCAAGCAGCAAGGTTGAATAATTTCACTGATGCTTCTGCTTATTGGATGTTTCCATACGAAACCTTCAACATGAGACAAGAAGTGGACGAAGTTTGGGAACAGGTCAAGCCGCTTTATGATGTACTACATGCATATGTCCGTCGTCGTCTTCGTGAAGCGTATGGACCTGAAAGAATCTCACGATCCGCACCTATCCCAGCTCATGTACTTGGTGATATGTGGGGGCAGAGCTGGTCTGGGATAGTCCCCTTCACTCTACCATACCCCGGGAAAAAACTCGTCGATGTCACTCCCGAAATGGTGCAACAGGGTTACACACCCCTTACGATTTTCCAACTGGCGGAGGAGTTCTACGTTTCCATGAACATGTCTGCGATGCCTCCAGACTTCTGGGCACTGAGCGTGTTTGAGCAGCCTGCTGACCGACACGTGCACTGTCAACCGTCTGCTTGGGACTTTTGTAATGGACACGATTACAGAATAAAGATGTGCACTCATCCAGATCAGAAAGATTTGATAACGGCTCACCACGAGATGGCACACATTGAATATTTCCTGTCATACAGAAATCAACCGAAGGTCTTCCGCGACGGAGCCAACCCAGGATTCCACGAAGCGATCGGCGAGGCAATCGCGCTTTCCGTGTCATCTCCCCGCCACCTCCAAACCCTGGGTCTCATCCAGAAGTCTGTAGATGACACGGCCCACGACATCAATTATCTCTTCACACAGGCGATGGATAAACTGGCTTTCCTTCCATTCGCCCTGGTGATGGATAAATGGCGCTGGGATGTCTTCACAGGCGACGTTAGGAAAGAGCAGTACAATTGTCATTGGTGGAGATTAAGAGAACAGTACGAGGGCATTAAGCCGCCAGTGCTACGTTCTGAATTGGACTTCGATCCCGGCTCCAAATATCACATACCAGCAAACATTCCCTATATAAGGTGA

Protein sequence:

>DPOGS205560-PA
MACAIGRILIRTLFQFTIVHILYADPQLDLPQLPQVSSQRTPYGYGFSSTQSNLPTSQFDLNRNFQYSTARAPSAVTPGVYPVSSTTHQINPDINNNGYPSLDYGNGRNPSSTLRPFDDRNRDDRIDITNDANFRRNDPNFARNDPSYVGTNPYNFNGDVNSNRIDERLNHASLQQIKDFLHQADAQASKECTNNVAAQWNFETDVNDATQHAALEAQQRYTLFQRGLWEAAQGLPRGAIRDFSTFRQLKLLSTIGPAALPPDQLDRYNRIINDMLAVYNSAEICAYNEPFKCGLHLQPDLQFTMSHSRDWDELQHVWTEWRRNTGRRIRDLYEQLVDLTNQAARLNNFTDASAYWMFPYETFNMRQEVDEVWEQVKPLYDVLHAYVRRRLREAYGPERISRSAPIPAHVLGDMWGQSWSGIVPFTLPYPGKKLVDVTPEMVQQGYTPLTIFQLAEEFYVSMNMSAMPPDFWALSVFEQPADRHVHCQPSAWDFCNGHDYRIKMCTHPDQKDLITAHHEMAHIEYFLSYRNQPKVFRDGANPGFHEAIGEAIALSVSSPRHLQTLGLIQKSVDDTAHDINYLFTQAMDKLAFLPFALVMDKWRWDVFTGDVRKEQYNCHWWRLREQYEGIKPPVLRSELDFDPGSKYHIPANIPYIR-