Monarch geneset OGS2.0

DPOGS209805
TranscriptDPOGS209805-TA1716 bp
ProteinDPOGS209805-PA571 aa
Genomic positionDPSCF300117 - 201287-219600
RNAseq coverage54x (Rank: top 69%)
Annotation
HeliconiusHMEL0118690.070.93% 
BombyxBGIBMGA008042-TA1e-11980.08% 
DrosophilaCG11284-PA8e-4138.31% 
EBI UniRef50UniRef50_D6W9D11e-4640.87%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W9D1_TRICA
NCBI RefSeqXP_001651049.11e-5042.04%carbonic anhydrase [Aedes aegypti]
NCBI nr blastpgi|1571103142e-4942.04%carbonic anhydrase [Aedes aegypti]
NCBI nr blastxgi|1571103141e-4942.04%carbonic anhydrase [Aedes aegypti]
Group
KEGG pathwayaag:AaeL_AAEL0055203e-50 
 K01672 (E4.2.1.1)maps-> Nitrogen metabolism
InterPro domain[332-555] IPR0235616.4e-74Carbonic anhydrase, alpha-class
[332-555] IPR0184336.4e-74Carbonic anhydrase, putative, insect
[332-565] IPR0011481.2e-58Carbonic anhydrase, alpha-class, catalytic domain
Orthology groupMCL16791 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209805-TA
ATGTTTGCAAAGCCAAAAACTGTTTTTGTATTCCGGTTGCCGACGCAATATCCAATAGTCACTATACCAATTTTGCGGAAGTCTCGCCAATCTACATATTTTGCAAAACGGAAGAAATTGAAGAAGACGAAGAAACTAAATACTAAAGCTCCAGAATTAAAAGAATGGACGTATAAAGATCAGCACGATTGGCCGAGACGCTATCCTGATTGCGGCGGTCGATCGCAATCTCCAGTCAACTTGCCTTATACACCACTCGTAAAGGCTAAAGAAAGCCGACAGCTTATGTTTCTTAACTATGACGTATTACCAAAGAAACTCATGCTGTGCAATGACGGAAAACGAATTGCTTTATATGGAGAATGGAAGCCCATAAATCAGCCTCTTATTTACGGAGGTGCAGCTCATAGCCGTCGATACTTATTTCATTCCTTAACACTTCATCGGCCTTCGGAACACAGAATAGGTGGTCTCCAATTTCCAATGGAAACTCAAGTGCTTTTCATTTCTGCGGAATACAAATCTTTTGCAGAAGCCATCAAAGCTTCCCTTAAAGATGCTCAGGCCTTCCTCGGTATTGTTAATATATACAAGTACGACAACCACACACAGCAAGGCTTGGAAGAATTACTAAAAGCGGGAACCAAACGCTTCAACACCTCCATGTCACTTCTACCATTAGGCTTCTTCACTCCTCCGTTACAGCAATATGCTTGTTATCAAGGATCATTAACTTTTCCCCCTTGCACTGAATCGGTTTTGTGGTTAATAAGAGCGAAGGCTTTACCTATTACAAGGAAGGCTATGGATGCGGCTAGCAGTATTTTTGAAGAAGATCATGTGGGATCTTGTCTAAGAGAACCGCAGCCTCTTAACGATAGAAGATGCAAGTGGTGGTTATTTAAGATGAGATCCAAATCTATTTCGTTAAAAGTAAGCGCATCAAATACCGTCGCTTCAGATGCTGAACAAATATCAAAGCTCAGAGCATCACAGTCGCCGATTGCAATTTCACTCCAACGATGCCCCACTTGGTCCTCTTTAGATCCTCTAGTATTTAAAGGATATTGGGACAATAATTCCAATGGCATTCTTGTCAATACTGGACAAACAGCTTATTTTACATTCGACACACCGTCTCGGCCGCGACTGAGCGGAGGTCCACTTATCGGTGAATATATTTTTGAACAAATGCACTTCCACTGGTCGGTTGATGATTTCACAGGATGCGAACATGTCCTTGACGGTCACGGTTACGCTGCGGAGTGCCACCTTGTACATTACAATAGCAAATACCAGTCACTCGAGGCAGCTGTGCCTCACGCAGATGGTTTGGCTGTAGTTGGATATTTATTGGAAGCAGTCGATGCACCGAACCCGAACTTTGATATGTTCATTGAGGGCTTGGAACAGATTAAGAAACCGGACCACAGTGTTGCTCTATCAGCAGAGTCTTTGGCTTGGATGAACAGAGAGGATGTGACCAACGGTAGCTACGTCACTTACAAAGGATCTTTGACAACGCCACCTTATGGAGAATGTGTCACGTGGATCATTTACGAGAAAGCAGTACAAATTGGTAGCGAACAGCTGGGGCTTTTAAGACAATTGGAAGGAGCAGACAGTGTACCAATTGAGAGAAATGTGAGGCCTACACAGCGGCATCCACCAGGACATTCTGTTATATATGTTAAACAAGTAAAGTCGAAGCTTTGA

Protein sequence:

>DPOGS209805-PA
MFAKPKTVFVFRLPTQYPIVTIPILRKSRQSTYFAKRKKLKKTKKLNTKAPELKEWTYKDQHDWPRRYPDCGGRSQSPVNLPYTPLVKAKESRQLMFLNYDVLPKKLMLCNDGKRIALYGEWKPINQPLIYGGAAHSRRYLFHSLTLHRPSEHRIGGLQFPMETQVLFISAEYKSFAEAIKASLKDAQAFLGIVNIYKYDNHTQQGLEELLKAGTKRFNTSMSLLPLGFFTPPLQQYACYQGSLTFPPCTESVLWLIRAKALPITRKAMDAASSIFEEDHVGSCLREPQPLNDRRCKWWLFKMRSKSISLKVSASNTVASDAEQISKLRASQSPIAISLQRCPTWSSLDPLVFKGYWDNNSNGILVNTGQTAYFTFDTPSRPRLSGGPLIGEYIFEQMHFHWSVDDFTGCEHVLDGHGYAAECHLVHYNSKYQSLEAAVPHADGLAVVGYLLEAVDAPNPNFDMFIEGLEQIKKPDHSVALSAESLAWMNREDVTNGSYVTYKGSLTTPPYGECVTWIIYEKAVQIGSEQLGLLRQLEGADSVPIERNVRPTQRHPPGHSVIYVKQVKSKL-