Monarch geneset OGS2.0

DPOGS204378
TranscriptDPOGS204378-TA969 bp
ProteinDPOGS204378-PA322 aa
Genomic positionDPSCF300002 - 1751150-1756730
RNAseq coverage2046x (Rank: top 6%)
Annotation
HeliconiusHMEL0130746e-10057.01% 
BombyxBGIBMGA007678-TA8e-3137.13% 
Drosophilaobst-E-PB1e-4141.89% 
EBI UniRef50UniRef50_F4WJU23e-5847.30%Chondroitin proteoglycan-2 n=8 Tax=Endopterygota RepID=F4WJU2_ACREC
NCBI RefSeqNP_001165852.14e-6047.35%cuticular protein analogous to peritrophins 3-E [Nasonia vitripennis]
NCBI nr blastpgi|3071682771e-5952.75%Chondroitin proteoglycan-2 [Camponotus floridanus]
NCBI nr blastxgi|3071682772e-6453.00%Chondroitin proteoglycan-2 [Camponotus floridanus]
Group
Gene OntologyGO:00080618.3e-15chitin binding
GO:00060308.3e-15chitin metabolic process
GO:00055768.3e-15extracellular region
KEGG pathwaycin:1001759715e-07 
 K05030 (CLCA4)maps-> Olfactory transduction
InterPro domain[143-214] IPR0025578.3e-15Chitin binding domain
Orthology groupMCL16130 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204378-TA
ATGAGGGGTCTTATTATCTTGGTGACTTTATGCGGTGTTGTTTTCGGACGCGCTCAAATTTCTGGAGAAGAGAAAAATGTGCGAGAGGAATTAAATCCGGTAGAAATCATTGATGATAGTCAATTCACCAAAGAAAATGAAGCTACCACAGAAGCTTTGGTATTTGAACGGAGAGCAAAATACCGAGCTCTGCCTGACCAATACAACGAGGAATCCTCGCCAGTAGCGGTCAGCAAAACATGCAGGGAAAAGAACGAAAGGTACTCAATACCTGGAAGCTGCGACAGATATATTGAATGCCTGAACGGAACAGCCGAAGAGAAGACATGCCCTGATGGATTACGTTACAATCCGAACGTCAATTTCAATGTGTACCCCTGCCAGTACCCTATAGACGTACCTTGCTTGGAGAGATCGGCCGGATTGCAACCACCACAGCCTACAGAGGATTGTCCCCATCAATTCGGTTACTTCAAGATTGGTGATGCCAAAAACTGCAGCGGATTCAGGAACTGCGTGAACGGCGTTGCTTATGACTTCACCTGTCCTGACGGTCTTGCCTTCAGCTCTGAATCTTATCGCTGCGAATGGCCTGATGAATCCAAAGATTGTGATGCAGAAGCTTTCCTCGGCTTCCGTTGTCCCCCTGTGCCAGAATCTAGGGAACTAGGTGCTCCAGCAGGTTTCAGATTTTACAGATCTCCCAGCAACTGTCAAAATTACTTCCTCTGTATTAACGGCAAACCTCGTCGATTGAGCTGTGGTGGTTATTCAGCCTTTGACGAGTCATCGGAATCCTGTATATCTGCTGTAGACATACCAGAATGTCCTGCAGAACTGAGAGCCCGCGCTGCCCAAATCATTGAGGACGAAAAGCTGCGAAGCACAGCGGAAGCCGCCTTCGCTAAACTCAGATACGTCCAAAGCGAAGACGTCAAAGAATCTACGACGGTATCTTATGATGCATAA

Protein sequence:

>DPOGS204378-PA
MRGLIILVTLCGVVFGRAQISGEEKNVREELNPVEIIDDSQFTKENEATTEALVFERRAKYRALPDQYNEESSPVAVSKTCREKNERYSIPGSCDRYIECLNGTAEEKTCPDGLRYNPNVNFNVYPCQYPIDVPCLERSAGLQPPQPTEDCPHQFGYFKIGDAKNCSGFRNCVNGVAYDFTCPDGLAFSSESYRCEWPDESKDCDAEAFLGFRCPPVPESRELGAPAGFRFYRSPSNCQNYFLCINGKPRRLSCGGYSAFDESSESCISAVDIPECPAELRARAAQIIEDEKLRSTAEAAFAKLRYVQSEDVKESTTVSYDA-