Monarch geneset OGS2.0

DPOGS208711
TranscriptDPOGS208711-TA765 bp
ProteinDPOGS208711-PA254 aa
Genomic positionDPSCF300043 - 104757-106005
RNAseq coverage117x (Rank: top 58%)
Annotation
HeliconiusHMEL0152572e-14492.91% 
BombyxBGIBMGA003357-TA1e-8880.34% 
DrosophilaCAHbeta-PA7e-10667.45% 
EBI UniRef50UniRef50_Q9VHJ58e-10467.45%CG11967 n=18 Tax=Bilateria RepID=Q9VHJ5_DROME
NCBI RefSeqXP_970970.12e-11169.80%PREDICTED: similar to carbonic anhydrase [Tribolium castaneum]
NCBI nr blastpgi|910841654e-11069.80%PREDICTED: similar to carbonic anhydrase [Tribolium castaneum]
NCBI nr blastxgi|2700080601e-10669.80%hypothetical protein TcasGA2_TC014816 [Tribolium castaneum]
Group
Gene OntologyGO:00159761.7e-52carbon utilization
GO:00082701.7e-52zinc ion binding
GO:00040891.7e-52carbonate dehydratase activity
KEGG pathwaydme:Dmel_CG119675e-104 
 K01672 (E4.2.1.1)maps-> Nitrogen metabolism
InterPro domain[170-237] IPR0017651.7e-52Carbonic anhydrase
Orthology groupMCL16694 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208711-TA
ATGGACAGAATATTAAGAGGTATTATGAGATACCGAGTGTTAGATCGAGCGACTATGGTCAAACAGTTTCAACAAGTCAAAGACAACCCCGTGCCAAAAGCTATTTTTTATACATGTATGGACAGTAGAATGATTCCAACTAGATTTACTGAGACATGCGTGGGGGATATGTTCGTTATCCGAAATGCTGGAAATCTCATACCTCACTCTCGACACTTTGTAGATGAAATGACCAGTTGTGAACCGGCTGGTTTAGAACTTAGTTGTATCGTAAACGATATTAAGCACGTCATTGTATGCGGACACAGTGATTGCAAGGCAATGAATCTTCTCTATAAGTTAAAAAGTGCGGACGAATCAAATTTAGAACAAAGAAGAATCTCTCCACTAAAATCTTGGCTCTGTGCTCATGGCAAATCTAGTTTAAACAAATTCTTAGATGTGAAAGGTGATTTTAATAAGCCTATTTTATTTTCTGCTGAAACACCACAGCGAAAATTTGTTGCTTACATTGATCCCGAAAATCAATTTTGTATAGAAGATAAATTATCGCAGGTTAACACTTTACAACAATTGCAAAATATTGCGTCTTATGGCATGTTAAAAAAACGGCTCGAGAAGCATGATTTACACATTCATGCTTTATGGTTTGATATTTATACTGGTGATATATACTATTTCAGCAGAAGAGCTAAAAGATTCCTTATAATAGATGAAGCTAGCTACGAAGTTATTCTAGCCGAAATTCGAAGATATTACTCCTAG

Protein sequence:

>DPOGS208711-PA
MDRILRGIMRYRVLDRATMVKQFQQVKDNPVPKAIFYTCMDSRMIPTRFTETCVGDMFVIRNAGNLIPHSRHFVDEMTSCEPAGLELSCIVNDIKHVIVCGHSDCKAMNLLYKLKSADESNLEQRRISPLKSWLCAHGKSSLNKFLDVKGDFNKPILFSAETPQRKFVAYIDPENQFCIEDKLSQVNTLQQLQNIASYGMLKKRLEKHDLHIHALWFDIYTGDIYYFSRRAKRFLIIDEASYEVILAEIRRYYS-