Monarch geneset OGS2.0

DPOGS212730
TranscriptDPOGS212730-TA2457 bp
ProteinDPOGS212730-PA818 aa
Genomic positionDPSCF300012 - 272395-281702
RNAseq coverage87x (Rank: top 63%)
Annotation
HeliconiusHMEL0139520.054.42% 
BombyxBGIBMGA013276-TA9e-11247.86% 
DrosophilaCG8560-PA7e-6435.85% 
EBI UniRef50UniRef50_E3WTD24e-8933.48%Putative uncharacterized protein n=2 Tax=Anopheles RepID=E3WTD2_ANODA
NCBI RefSeqXP_316270.49e-6535.20%AGAP006206-PB [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123812802e-8833.48%hypothetical protein AND_06451 [Anopheles darlingi]
NCBI nr blastxgi|461982827e-8841.55%midgut carboxypeptidase A2, partial [Trichoplusia ni]
Group
Gene OntologyGO:00065081.6e-90proteolysis
GO:00082701.6e-90zinc ion binding
GO:00041811.6e-90metallocarboxypeptidase activity
GO:00041802.7e-11carboxypeptidase activity
KEGG pathway 
InterPro domain[130-415] IPR0008341.6e-90Peptidase M14, carboxypeptidase A
[28-119] IPR0090207.5e-14Proteinase inhibitor, propeptide
[30-110] IPR0031462.7e-11Proteinase inhibitor, carboxypeptidase propeptide
Orthology groupMCL20960 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212730-TA
ATGTGGAGGTGTAACGGACACGACATGAGGGCCTTAGTATGTTTGTTCCTTGGTCTTGTGGCATCGACCTTAGCTGGGAACCATGACAAATATTCCGGGTATACCGTCTATGGGGTTCATATAGATGATCACTACCACCAGAAGGTACTTTCGGTTTTACAGAGGGATATCGATTTAGACGTTTGGCGACATGGGGTTCCGAAAGCCAGAGATGCTCTGGTTATGGTTTCGCCAGAAAATAGACTGCAATTCCTAAAGATTTTGGAAGAAAATAACATGCACCATTACATTCATCTCCATGATGTTGCGAGATCTCTGAAACAAAGTGATGATGATTTTTTACGGTGGAAACTGAGCAGAGGAAATGAATTGTCTATTTTTGAGGATTATCCAACATACGGTGATGTCATCGACTACATGGAGAGAATAGCTCGTAATAATTCAGCCATAGCGACTTTAGTGAACGCCGGCAATAGTTTCGAGGGACGACCTGTGAAATACTTGAAGATATCAACCACAAACTTTACTGACACCAGTAAACCTATCTACTTCCTGGAGGCAACGATGCATGCTCGTGAGTGGGTGACAACCCAAACAGCTCTGTACACCATACATCGATTGATCGAAGACCTGAAGACTGAGGATAGGGATCTGATAGAAGGGATCGATTGGATCATATTTCCCGTGGTCAATCCTGATGGATATGAATTTTCTCATACAACCGATCGCATGTGGAGGAAAACACGTTCTTTCAATGCAACGATCAGCGCCACGTGCTACGGTGTCGATCCAAACAGGAACTTTGACATTGAATTTAATACAGTAAGCGTGTCTCCGGATCCCTGTTCCCAAATATATCCGGGACACGAAGCTTTCTCGGAACCAGAAACCCGTTACGTCAGAGATATTTTATTGGAATACAATAATCGCATCCAAGTGTTCATGGACGTGCATAGTTACGGCAACTACATTGTTTACGGCTTCGATAACGGAACGCTGCCACAGAACGCGCTGCACATACATCACGTGGGTGCGTTAATGGGTGCGGCTATCGATACTCTTAAATTACAAAAAGCACCCTTCTACATTGTCGGCAACTCGAGGTACGTCTTCTACGCTGTCTCTGGAAGTGGTCAGGATTACGCACAGGCGGTTGGCGTAGGATTCTCGTACACTATAGAGTTGCCAGGATACGAGTATGACTTCCGAGTTCCTCCCTCGTACATCAATCAAATCAACACGGAGACCTGGGAAGGTGTCGCAGCCTCGGCTCGAGCTGCAAGATCTTACATTGTGTATGGTGTTAAGTTAGAAGATCAGGTGGACCAAGAGGTACTTTATGGGCTACAATCAGAACTGGATTTGGATTTATGGGAGTACGGAGTTCCCAAAGTTAAAGATGTTCTCGTCATGGTGTCACCGGACAAGAAAGAGAGATTTTTGGACATTTTAGATAGAAATTATATCAAACACTACCTTCACCTATCTGACGTGGCCCAGGCTTTGGAAGAGAACGACAACGATTTGTCTAGTTGGGAACGTGAATCAAGCAGAGTTTTTGAAAAGTATGCAAGATATGCTGAGATAGATGCATACCTTGAAGAGGTAGCACAAGCTCATCCCCGGATTGTGACGCTCGTGAATGCTGGACTTAGTTTCGAAGGACGTCCTATCAAATATTTAAAGATATCCACTTCAAACTTCACCGACCCCAGCAAGCCTGTCTACTTCATCGACGCCGCGATGCACGCTCGCGAGTGGATTACGATTCCACCAGCTCTGTACAGCATTCATCGTCTGGTGGAAGACCTTCGAGAACAAGACCGAGACTTACTGGAAGAAATCGACTGGATTGTGATGCCGCTGGAACGCTTATGGCGGAAGACGCGTTCCTTCAATGTCACAAGACATCCTGAATGTTACGGAGTGGATGCGAATCGAAACTTCGACGTAGACTTCTATGGCACCGGCTCCAGTACCAATCCCTGCGTGAACACATTCCGTGGTCACGAACCATTCTCAGAGCCGGAAACCCGCTGTGTTCGAGACGTCATTCTAGAACACATAGATCGCTTGCAAGTGTACCTTAATGTACACAGTCATGGTAACCTCATCTTGTATGGTTACGGCAATAAAACCTTACCCTCCAATGTTGTCCAACTACATCAAGTCGGTGCCATCATGGGAGCAGCTATAGATCATAAAAAACTCCTCGAGGCTCCGTATTATCTGGTGGGAAATAGTGCGCTTGTACTGTACACAAGCTCCGGCAGCGCACAGGATTATGGACAGGTGGTCGGTGTACCCTTCTCCTATACACTGGAGTTGCCTGGAATGGGTTATGGGTTCCAGATTCCCGTCAGGTTCGTCAACCAAGTCAATATGGAAACCTGGGAAGGCATTGCTGCATCAGCACGAATCGCTAAAATATATTATAGAGCGAGAGATCAAAAGTAA

Protein sequence:

>DPOGS212730-PA
MWRCNGHDMRALVCLFLGLVASTLAGNHDKYSGYTVYGVHIDDHYHQKVLSVLQRDIDLDVWRHGVPKARDALVMVSPENRLQFLKILEENNMHHYIHLHDVARSLKQSDDDFLRWKLSRGNELSIFEDYPTYGDVIDYMERIARNNSAIATLVNAGNSFEGRPVKYLKISTTNFTDTSKPIYFLEATMHAREWVTTQTALYTIHRLIEDLKTEDRDLIEGIDWIIFPVVNPDGYEFSHTTDRMWRKTRSFNATISATCYGVDPNRNFDIEFNTVSVSPDPCSQIYPGHEAFSEPETRYVRDILLEYNNRIQVFMDVHSYGNYIVYGFDNGTLPQNALHIHHVGALMGAAIDTLKLQKAPFYIVGNSRYVFYAVSGSGQDYAQAVGVGFSYTIELPGYEYDFRVPPSYINQINTETWEGVAASARAARSYIVYGVKLEDQVDQEVLYGLQSELDLDLWEYGVPKVKDVLVMVSPDKKERFLDILDRNYIKHYLHLSDVAQALEENDNDLSSWERESSRVFEKYARYAEIDAYLEEVAQAHPRIVTLVNAGLSFEGRPIKYLKISTSNFTDPSKPVYFIDAAMHAREWITIPPALYSIHRLVEDLREQDRDLLEEIDWIVMPLERLWRKTRSFNVTRHPECYGVDANRNFDVDFYGTGSSTNPCVNTFRGHEPFSEPETRCVRDVILEHIDRLQVYLNVHSHGNLILYGYGNKTLPSNVVQLHQVGAIMGAAIDHKKLLEAPYYLVGNSALVLYTSSGSAQDYGQVVGVPFSYTLELPGMGYGFQIPVRFVNQVNMETWEGIAASARIAKIYYRARDQK-