Monarch geneset OGS2.0

DPOGS202239
TranscriptDPOGS202239-TA2367 bp
ProteinDPOGS202239-PA788 aa
Genomic positionDPSCF300032 - 909854-913921
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0021080.060.58% 
BombyxBGIBMGA004830-TA4e-16064.69% 
DrosophilaCG8560-PA6e-6936.66% 
EBI UniRef50UniRef50_Q6H9621e-16665.03%Carboxypeptidase n=4 Tax=Obtectomera RepID=Q6H962_HELAM
NCBI RefSeqXP_002035702.15e-6837.59%GM13761 [Drosophila sechellia]
NCBI nr blastpgi|491686874e-16665.03%carboxypeptidase precursor [Helicoverpa armigera]
NCBI nr blastxgi|491686872e-16166.59%carboxypeptidase precursor [Helicoverpa armigera]
Group
Gene OntologyGO:00065082.2e-98proteolysis
GO:00082702.2e-98zinc ion binding
GO:00041812.2e-98metallocarboxypeptidase activity
GO:00071861.2e-31G-protein coupled receptor protein signaling pathway
GO:00160211.2e-31integral to membrane
KEGG pathway 
InterPro domain[489-772] IPR0008342.2e-98Peptidase M14, carboxypeptidase A
[21-351] IPR0002761.2e-31GPCR, rhodopsin-like, 7TM
[385-476] IPR0090203.3e-06Proteinase inhibitor, propeptide
Orthology groupMCL25069 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202239-TA
ATGATCATAGGTGTATCAATCATGTTCGGCGTTAGTATAGAAAATCCCATGAGCTGTAATCCACTCTGTATATTTCAAATCGGTATGATCGTATGTCCAGCCATGGTCTCCATATTCACGGTTGGCTTCATCGCTGTGGATCGCTACATTTATATACTCCATGGCTTGTATTACCAGAAATGGTTTACTACGACGAGAGTCAGAATTAGCATTCTTTGTATTTGGCTGATAGGCATAACTCTCTCATTTATCCCAGCGAGCGGGTGGATAAATACCGAACTAAGGCACTCGAGGTGTTATTACGTTACCCTTTTTCCTGGATCACTCATCTTACTTAATTCACTACTGAGTATTATCCCCATCGTACTTGTAGCTGTTCTGTATACTATTATTTTAATAAGAGCTTTGAGAAATGTAAGAGAAATAAAAACAACTATAAAAAACGTGGGAGCGAAATCAACCAACAATGAATTACGAATTCACATGGGAGGTGACAGAATTTTAGGGAAATATACTCAGTCAGTAAAATATCCGACCTCAAATAAATTTAAAATTCAAAGGACCGCTTCATTCAATGAAAATTACAACTATAACAGACCACGATCAGAAATCTTCAAGATTGATGAGAAATCTAGAAGTACAGTGAATTTAAACGAAAGAAATGATTTTCAAACAAGCAATTCCTCAGCTGATAATCAAAACAATAGAGATAGTTGTGAATCAGAATTTAGCATTTATTCCATCAGTTCAAGGATACCAGAGCAAACCGTCCACGTCATGTCATCGGAGGAGTCGCGTAACAAGAAAACTGAGGGCACAAAGCGAAAAGTGCAAAAAATGAAGGAACCAAACAAGTGGAGAGCAATCATAATTGTGATGCTGACGTCAGGAAGTTTTATATTTACTTGGATGCCGTTTTTCATAACTGTCATATTTTTTGTTTTCTGCGAGGAAAAACTTACTAATCCCAAGTGCATGCACCTCAGAATGATGCTGAGTGGACCGATAGCGACCCTAGCGTTCCTGAATAGCATCCTCAATCCCATGATATACGCCTGGGAGCCAAGTCAAGTCCGAAGGATTAGCCGGCTAGTTGTCAAAATGGCAAGGTGGATTACCTTCCTGCTATTCATCGCAGTTACCCACGCTCGACATGAGCAATACGAGGGGCACTCTCTCTACCGGGTAGCTGGTCCATCTGACCAATTTCAATACCTAGAAGCCTACATAGATTTCCTCTCTATTACACCAGCAGCCAAATCCACTTCCAGACAGCTAGAGGTTTTGATGAGACTATCATCAGAGGAAAAAGGCAAATGGCTGAAATATTTCGAAGACCATGACATGCCCTATTCTTTAGTATCAAACAACTTGGCACAGGTTCTTCGTGCTGAGGATTCTTATTTAATGAGGTCTAAAGAAGGCGAAGCTGAAGATACAAACAGTACAATGACATGGGATAGTTATTATAACGCGGAAGAGATCAACAAATACATAGACGAGATGGGCGCAAAATACCCTGACCTCATAACAGTTATCAACGCCGGCAGGAGTTACGAAGGTCGACAGATCAAATATGTCAGGATTTCCACCACACGCTTTGAAAACCTTCGCAAAAGAGTAATAGTTATAGACGCTGGTGTACATGCTAGAGAATGGGTTACCACACCAGTAGCTCTGTATTTGATCAAGCAATTGGCCGAAGGCGCTGATAAATTACTGACTGAAAACCTCGACTGGATCATTATACCTTTAGCAAACCCAGATGGTTACGAATACTCCATAAATGAGGATCGTTTATGGCGTAAAACCCGTTCTAAATCTCACGCTGGCTCAGACGCATGTCCTGGTGTTGATGGAAACCGAAACTTCGATTTCGACTGGGGCTCCAGACCTGACTCTAACATAGCCTGCTCCATTATTTACGAAGGACCATCACCCTTCTCGGAACCAGAGACACGTATCATAAGAGACGCTGTTTTGTCAAATTTGGCTCGTACTTCCCTTTACATTTCTCTACACAGCTATGGCAACATGTTCCTTTACGCTTGGGGAACTAACGGTACACTTCCTTCAAATGGCCTATCTCTTCACCTTGCCGGTATCATCATGGCAACAGCTATTGAGGAAGTCAAATTAGAAAAAGCTGATTCTTACATTGTCGGTAATGCTGCTAACGTTCTGTACTACACCAGCGGTACCTCAAGAGACTGGACTCGTGGCATGGGAATACCATTCACCTATACCATGGAACTTCCTGGTTATGAGTACGGCTTCCTCGTCCCACCCACTTACATCAAGCAAATAGTGACCGAATCTTTCGTAGGAATAGCTGCTGGAGCTCGTTACGTACTCTCACTATACTGA

Protein sequence:

>DPOGS202239-PA
MIIGVSIMFGVSIENPMSCNPLCIFQIGMIVCPAMVSIFTVGFIAVDRYIYILHGLYYQKWFTTTRVRISILCIWLIGITLSFIPASGWINTELRHSRCYYVTLFPGSLILLNSLLSIIPIVLVAVLYTIILIRALRNVREIKTTIKNVGAKSTNNELRIHMGGDRILGKYTQSVKYPTSNKFKIQRTASFNENYNYNRPRSEIFKIDEKSRSTVNLNERNDFQTSNSSADNQNNRDSCESEFSIYSISSRIPEQTVHVMSSEESRNKKTEGTKRKVQKMKEPNKWRAIIIVMLTSGSFIFTWMPFFITVIFFVFCEEKLTNPKCMHLRMMLSGPIATLAFLNSILNPMIYAWEPSQVRRISRLVVKMARWITFLLFIAVTHARHEQYEGHSLYRVAGPSDQFQYLEAYIDFLSITPAAKSTSRQLEVLMRLSSEEKGKWLKYFEDHDMPYSLVSNNLAQVLRAEDSYLMRSKEGEAEDTNSTMTWDSYYNAEEINKYIDEMGAKYPDLITVINAGRSYEGRQIKYVRISTTRFENLRKRVIVIDAGVHAREWVTTPVALYLIKQLAEGADKLLTENLDWIIIPLANPDGYEYSINEDRLWRKTRSKSHAGSDACPGVDGNRNFDFDWGSRPDSNIACSIIYEGPSPFSEPETRIIRDAVLSNLARTSLYISLHSYGNMFLYAWGTNGTLPSNGLSLHLAGIIMATAIEEVKLEKADSYIVGNAANVLYYTSGTSRDWTRGMGIPFTYTMELPGYEYGFLVPPTYIKQIVTESFVGIAAGARYVLSLY-