Monarch geneset OGS2.0

DPOGS209817
TranscriptDPOGS209817-TA978 bp
ProteinDPOGS209817-PA325 aa
Genomic positionDPSCF300117 + 222974-225931
RNAseq coverage53x (Rank: top 70%)
Annotation
HeliconiusHMEL0167634e-3130.97% 
BombyxBGIBMGA008043-TA2e-13370.07% 
DrosophilaCAH2-PA3e-2728.62% 
EBI UniRef50UniRef50_E0VS501e-5338.31%Major antigen, putative n=1 Tax=Pediculus humanus corporis RepID=E0VS50_PEDHC
NCBI RefSeqXP_002428944.12e-5438.31%major antigen, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420169234e-5338.31%major antigen, putative [Pediculus humanus corporis]
NCBI nr blastxgi|3071758733e-5239.00%Carbonic anhydrase 13 [Camponotus floridanus]
Group
KEGG pathwayphu:Phum_PHUM2598102e-31 
 K01672 (E4.2.1.1)maps-> Nitrogen metabolism
InterPro domain[28-274] IPR0011481.2e-50Carbonic anhydrase, alpha-class, catalytic domain
[46-273] IPR0235612.3e-46Carbonic anhydrase, alpha-class
[46-273] IPR0183412.3e-46Carbonic anhydrase-related, insect
Orthology groupMCL17385 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209817-TA
ATGTACAAAAAAGCAGCGCCAGCACAAACAGAGGAGGAAGAACAACATCCTAAAGAGGAGGAAATTGTTATTCCGGATATAGGGACTTTGGAATGGATATATTACTTGAGTGAATTAGAAGGTGACCTTCCAACTCCCATAGATGTGTCAATCACGGGTTCTCTAAAGTACCCGTGCCCAGATCTCGTCTGGTACAACTTCGAAATATACCCTCACAAAGTCAAAATAACAAACACCGGTCACACAGTCCTGCTTGGAGCGAAATGGAGAACCGAAAGACCTTATCTAAAAGGTGGACCCCTTTTAGAAAAACACATATTTTCTCAAGTACATTTTCATTGGGGTGCGGATATGATGGAGGGTAGTGAGCATACCATAGATAAGAGGCAATACCCAGCTGAAATGCAGGTAACTTTCTTTAGATCAGAATATATGACGCAGGAAGAAGCGTTCAAACATAATGATGGAGTTGTAATGATATGTTACATTATTAAGTATGGTGTAAATCCTGACCCGCGTCTCCAGTGGGTCTTAGAAGGATTTCCTCGTATACAGGAAGCTCAAACAAATACTCGAGTTGGACCATACCCTATGTCGCGGCTCTTGCCAATGTTTTTCGAAGATTACTTTTTATATTGGGGAAGTTTAACAACCGTGAAAGGAGAAAGACACGTAGTAAGATGGCTTATACCCAGACCTACCTTATACGCTTCTTTCGATCAGATGAAGGAATTTCGTAAGTTGTGGGATCCTTGGGACGAACCCATCGTGAGAAATTTCAGACCTCTTCAAGAACGAAACGATCGTCATGTACTCTTCATCCGTCCGCACTGGAATCAATACAACTCATTATTACCAATACCAAGAATTCCAGAGCCATCAATTTCAATTTTATCTCCAGCGTACCAAGCAAATCCATGGATGCTACCTAAGCAAAATGTCGATTTACAAACTCAACCCGAGAAAAATGAAAATTGA

Protein sequence:

>DPOGS209817-PA
MYKKAAPAQTEEEEQHPKEEEIVIPDIGTLEWIYYLSELEGDLPTPIDVSITGSLKYPCPDLVWYNFEIYPHKVKITNTGHTVLLGAKWRTERPYLKGGPLLEKHIFSQVHFHWGADMMEGSEHTIDKRQYPAEMQVTFFRSEYMTQEEAFKHNDGVVMICYIIKYGVNPDPRLQWVLEGFPRIQEAQTNTRVGPYPMSRLLPMFFEDYFLYWGSLTTVKGERHVVRWLIPRPTLYASFDQMKEFRKLWDPWDEPIVRNFRPLQERNDRHVLFIRPHWNQYNSLLPIPRIPEPSISILSPAYQANPWMLPKQNVDLQTQPEKNEN-