Monarch geneset OGS2.0

DPOGS201286
TranscriptDPOGS201286-TA2112 bp
ProteinDPOGS201286-PA703 aa
Genomic positionDPSCF300176 - 566735-572837
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0124012e-10564.47% 
BombyxBGIBMGA003037-TA0.072.51% 
DrosophilaCG31019-PA1e-13245.63% 
EBI UniRef50UniRef50_A7UVJ61e-13348.13%AGAP001814-PA n=1 Tax=Anopheles gambiae RepID=A7UVJ6_ANOGA
NCBI RefSeqXP_002001182.16e-13748.06%GI22112 [Drosophila mojavensis]
NCBI nr blastpgi|1951132531e-13548.06%GI22112 [Drosophila mojavensis]
NCBI nr blastxgi|3838650322e-13249.09%PREDICTED: cytosolic carboxypeptidase 6-like [Megachile rotundata]
Group
Gene OntologyGO:00065088.9e-20proteolysis
GO:00082708.9e-20zinc ion binding
GO:00041818.9e-20metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[186-349] IPR0008348.9e-20Peptidase M14, carboxypeptidase A
Orthology groupMCL14275 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201286-TA
ATGGAAGCCTTGGTGGAAGATCTTGAGCATGCCAGTATATATCGCCGACGTCCATGTGACAGCGAAGAAAGTGACGGGGAAGGCGGCTTGGGCAACCTTAACCGACTAATCGTCCGGCCGCCGGGGCACAGTGGGAAAGCAAAACGCGGTCATCTCTGTTTCGACGCGGCCTTCGAATGCGGTAACCTCGGCCGAGCTGATCACATTACGGAGCTGGAGTATGACCTGTTCGTCCGACCCGACACTTGCAGGCCTCGATCGAGGTTCTGGTTCAATTTTACCGTCGAAAACGTTAAACAGGAACAACGAGTACTTTTCAATATAGTCAATATGGGTAAAGAATATACACTATATAATGAAGAAATGACACCTATAGTACGATCAACATCTCGTCCGAAATGGCAACGCATACCACGGCGGCTTCTATTCTATCATAAGTCTGTGATACATCGTGGTCGTAAAATACTAAGCCTAGCATTTGCTTTTGATAAGGAGGAAGACGTCTATCAATTTGCAGCTGCCGTTCCATATTCTTATTCAAAATTGCAAAAGTATTTAGCTATATGGGAAAAAAGAGCCCAAGGTTTTGCCACCAGACGATCTATTGCGCAGACTACGCAAAAACGGAGGATCGATCATCTCACTATTGGAGATACTGTTATTGGAGCAGAGACTAAAGACGATCTGAGTACAAAAGGTAAAGGTCAAGAAATAAAGAAGAGGGTAGTTTTAATTTTAGCACGAACGCACGGCGGCGAACCACCATCTTCTATTATATGCCAAGGTTTTCTAGACTACATTATAGGTTCATCGGAAAAAGCATTGTCGTTACGGAATGGCATCCGCATTGAGGTGGTTCCAATGCTCAACCCTGATGGTGTATTTTTGGGGAACCAGAGGTCGGATCTGTTAGGTTCAGATCTTAATCGGTGTTGGAACCGAGCTACAACTTTCGCACATCCAGCTCTTGTTGCTGTAAATGATCTCATTCAAAAGATAAATGCTGAAAAGACCCTCCAATTAGACTTCATAATCGATATTCATGCTGATCTTAGTCATGAGGGAGTATTTGTCCGTGGCAATTCGTATGATGATGTTTACAGATTTGAACGTCACGCAGTCCTACCTAAGTTCTTGGCTATGCGAGTCGAAGCCTGGAAGCCGGAGTCATGTCTTTACAACACTGACTCTATCAGCGCGGGCAGTGCGCGTCGTACTTTACCGACGGGTACTGTGGATGCATACAGCCTCCTAGCATCACTTGGTGGAAGACGATTAACACCCAGGGGTCCATACCTACACTACACTGAAGACGCATACTCTAAAATAGGCAGATCCTTAGCGAAGGCTCTGTGTGATTACTACAGGCACATCGGGGTTATTCCACCACAGGAAGGAATACCAAATAAAAAGAAAGATCCCAGAAAACGAAGAAGAAAACGAAGAGCAGCATATGAGAGAGAAAGATCTCCCAGTTCATCTCCTGATCGTCGTGTGTTGGTATCCCGTGTCCTGAGCCCCCCTCTGCCTGTGACGTCACCCCCTGCAATTCGAGCACTTCCCCCAAGAACGGCTCGTAGCGTTCGCCAAAGAACGTGTACCCGAGTACATATTGGAGATAATCTGTTGCCCGCTATAACTGACTCTAAAATAGGCAGATCCTTAGCGAAGGCTCTGTGTGATTACTACAGGCATATCGGGGTTATTCCACCACAGGAAGGAACACCAAATAAAAAGAAAGATCCCAGAAAGCGAAGAAGAAAACGAAGAGCAGCATATGAGAGAGAAAGATCTCCCAGTTCATCTCCTGATCGTCGTGTGTTGGTATCCCGTGTCCTGAGCCCCCCTCTGCCTGTGACATCACCCCCTGCAATTCGAGCACTTCCCCCAAGAACGGCTCGTAGCGTTCGCCAAAGGACGTGTACCAGAGTACATATTGGAGATAATCTGTTGCCCGCTATAACTGGTAAAGCAGTTTGTATGCCGAAGCTGACAGTGGTAGACCTCAGCGCATACATTCGTACCCCACCTCATCCCATCCGTCAGAGGGTCCCTACGTCACGTCCGCCACGTGTCATGACTAGTGACGAATACGACACACTGACTGACTGA

Protein sequence:

>DPOGS201286-PA
MEALVEDLEHASIYRRRPCDSEESDGEGGLGNLNRLIVRPPGHSGKAKRGHLCFDAAFECGNLGRADHITELEYDLFVRPDTCRPRSRFWFNFTVENVKQEQRVLFNIVNMGKEYTLYNEEMTPIVRSTSRPKWQRIPRRLLFYHKSVIHRGRKILSLAFAFDKEEDVYQFAAAVPYSYSKLQKYLAIWEKRAQGFATRRSIAQTTQKRRIDHLTIGDTVIGAETKDDLSTKGKGQEIKKRVVLILARTHGGEPPSSIICQGFLDYIIGSSEKALSLRNGIRIEVVPMLNPDGVFLGNQRSDLLGSDLNRCWNRATTFAHPALVAVNDLIQKINAEKTLQLDFIIDIHADLSHEGVFVRGNSYDDVYRFERHAVLPKFLAMRVEAWKPESCLYNTDSISAGSARRTLPTGTVDAYSLLASLGGRRLTPRGPYLHYTEDAYSKIGRSLAKALCDYYRHIGVIPPQEGIPNKKKDPRKRRRKRRAAYERERSPSSSPDRRVLVSRVLSPPLPVTSPPAIRALPPRTARSVRQRTCTRVHIGDNLLPAITDSKIGRSLAKALCDYYRHIGVIPPQEGTPNKKKDPRKRRRKRRAAYERERSPSSSPDRRVLVSRVLSPPLPVTSPPAIRALPPRTARSVRQRTCTRVHIGDNLLPAITGKAVCMPKLTVVDLSAYIRTPPHPIRQRVPTSRPPRVMTSDEYDTLTD-