Monarch geneset OGS2.0

DPOGS212896
TranscriptDPOGS212896-TA1182 bp
ProteinDPOGS212896-PA393 aa
Genomic positionDPSCF300410 - 36302-39944
RNAseq coverage53x (Rank: top 70%)
Annotation
HeliconiusHMEL0161305e-17774.01% 
BombyxBGIBMGA003228-TA0.079.90% 
DrosophilaAnce-PB2e-7838.17% 
EBI UniRef50UniRef50_UPI00020627937e-11748.85%UPI0002062793 related cluster n=1 Tax=unknown RepID=UPI0002062793
NCBI RefSeqXP_974768.14e-12753.06%PREDICTED: similar to angiotensin-converting enzyme 8 (AGAP007982-PA) [Tribolium castaneum]
NCBI nr blastpgi|910870598e-12653.06%PREDICTED: similar to angiotensin-converting enzyme 8 (AGAP007982-PA) [Tribolium castaneum]
NCBI nr blastxgi|2700096221e-12353.06%hypothetical protein TcasGA2_TC008905 [Tribolium castaneum]
Group
Gene OntologyGO:00160201.5e-169membrane
GO:00082411.5e-169peptidyl-dipeptidase activity
GO:00082371.5e-169metallopeptidase activity
GO:00065081.5e-169proteolysis
KEGG pathwaytca:6636381e-126 
 K01283 (E3.4.15.1, ACE)maps-> Chagas disease
    Renin-angiotensin system
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[3-393] IPR0015481.5e-169Peptidase M2, peptidyl-dipeptidase A
Orthology groupMCL17126 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212896-TA
ATGTCCCAATGGAAATTCGTCACTAACATCACGGAATACAACCGGCGTCGTATGCTTGAGGAGTTGGCGCTGAGCTCCAAATTCGAAAAGCTATCTTGGAGGAAGGCTGCTGCTTACGATGGATCCAGACTGTCTGATCCGCAGAACAGGAGACAGCTGAGCAGAATCATTCAGAACAGTAGGGCTGCCCTGTCTGACGACAAGTTTTCTGAGATACAACAGCTGATAACCGAAATGAAGGAGCTCTATAATTCAGCGAAAATATGCCCATATGGCCAAAAACCTTACGATCCTCATATACCAAAAGTTTTGCCAACATCCACCGAAACACCCGATAGTACCAAGGACCACCAAACGTACCAGCTCACGAACCAGCCATACCAGCACTACCAACCATACCAGGAGAACACAGACTACCCTAATTATTGTGACATGCAATTAGATCCAGAGATCACTAGGATTCTAGCTCATTCGAGGATTGAAAATGAATTGCTATATGTCTGGAAAAGTTTCAGAGATCAAACCGGACCGAAGCTAAAGAACAGGTTCATGAGATACGTGCAATTGGCCAACGAGGCTGCTGTCAAAACTGGGTTCAAGGACGCCGGAGACCAGATGCGAGCTGCGTATGAGGACCCGTCATTCAGAGCGAGTGTCGAAGAAATTTACAATCAAATTATACCTTTGTACAAACAACTATTTACTTACGTCCGACGAAAACTTCTGCTGAGATACGGAGACAAAAGTGTCCGTCCAGATGGGCCCATACCAGCACATTTATTGGGGAACATGTGGGCACAGAATTGGAAATCTATAATGGACCTGGTGATGCCGTTCCCTCAAGCTCCGAACGTGGACGTCACTTCTGAGATGCTGAGACAGGGATTCACTCCTCTCAGAATGTTCCAGATGGCAGAAGAATTCTATACATCAATGGGCCTTCGCCCCGTCCCTCCGGAGTTTTGGCGAGGGTCGTTACTCGCGCGACCGGCTGACCGGAGCGCCCAATGCACAGCTAGCGCCTGGGACTTCTGTAATAGGATTGATTATAGAATAAAGCAATGCACGGAGGTGACGATGCAAGATCTGATCTCCACTCATCATGAGATGGCTCACATACAGTACTACTTGCAGTACTCGGAACAACCGCAGCTGTTCAGGGATGGAGCGAATCCAGGTTAA

Protein sequence:

>DPOGS212896-PA
MSQWKFVTNITEYNRRRMLEELALSSKFEKLSWRKAAAYDGSRLSDPQNRRQLSRIIQNSRAALSDDKFSEIQQLITEMKELYNSAKICPYGQKPYDPHIPKVLPTSTETPDSTKDHQTYQLTNQPYQHYQPYQENTDYPNYCDMQLDPEITRILAHSRIENELLYVWKSFRDQTGPKLKNRFMRYVQLANEAAVKTGFKDAGDQMRAAYEDPSFRASVEEIYNQIIPLYKQLFTYVRRKLLLRYGDKSVRPDGPIPAHLLGNMWAQNWKSIMDLVMPFPQAPNVDVTSEMLRQGFTPLRMFQMAEEFYTSMGLRPVPPEFWRGSLLARPADRSAQCTASAWDFCNRIDYRIKQCTEVTMQDLISTHHEMAHIQYYLQYSEQPQLFRDGANPG-