Monarch geneset OGS2.0

DPOGS210627
TranscriptDPOGS210627-TA954 bp
ProteinDPOGS210627-PA317 aa
Genomic positionDPSCF300168 + 487672-493491
RNAseq coverage300x (Rank: top 37%)
Annotation
HeliconiusHMEL0174153e-8573.74% 
BombyxBGIBMGA013578-TA9e-6055.72% 
DrosophilaCG3940-PA3e-2933.97% 
EBI UniRef50UniRef50_A9QW251e-3134.93%Glycosyl-phosphatidylinositol-linked carbonic anhydrase n=2 Tax=Portunidae RepID=A9QW25_CARMA
NCBI RefSeqXP_971186.11e-3234.26%PREDICTED: similar to carbonic anhydrase [Tribolium castaneum]
NCBI nr blastpgi|910933772e-3134.26%PREDICTED: similar to carbonic anhydrase [Tribolium castaneum]
NCBI nr blastxgi|1613678599e-3234.71%glycosyl-phosphatidylinositol-linked carbonic anhydrase [Carcinus maenas]
Group
KEGG pathwaytca:6598214e-32 
 K01674 (E4.2.1.1B, cah)maps-> Nitrogen metabolism
InterPro domain[1-262] IPR0235619.4e-62Carbonic anhydrase, alpha-class
[18-264] IPR0011488.5e-58Carbonic anhydrase, alpha-class, catalytic domain
Orthology groupMCL26093 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210627-TA
ATGTGGATGTTGTTTGTGCCCTTTCTGGCTGCAGCGGCTGTTTGTACCGCAGCTGAAGATTGGTCGTACGATTATGAAACCAAATGGCCGGGAGTATGCACGACTGGTGAGAAGCAGTCTCCAATAAATATAATGTCGAGAGACGCTATAGTTGACAAACTTGGGACACACATCAAGGGACCGCTGGTTTTCAGGGGATATGGTAGCGTGAACGTCAGTGGTGCCAACACCGGACACACGTTGAAATGGACGCTGGAAGAAGACGAGCCCAGCCCTGTAGTGTCCGGCGGTCCATTGAGAGGCAACTACAGCTTTGTACAATTTCATCTGCATTGGTTGTCGGAACACGCCATAGATGGAATGAAATATCCGATGGAAATTCATATGGTGCACATGAAGACTGGTCTGACCGCTGAGGAAGCAGTAGAAAGACCTGATGGCATCGTTGTTATCGGGATTCTCTGTCAGGTACACAGTGGCGAGGAAAGTGAATTTGCCCTGGGAGAACTGCAGCCATCTCTTCCGAAACTCATAGAACGTAGTGTGGGCGCTGTACCGCCGACCGTGTTGGATCTCACACGTCTCTTCAGCCCCAATATGCAATCTTTCTACACCTACCACGGCTCGATCACCACACCACTCTGTCAGGAGGTCGTAACTTGGCTAGTTATGGACAAGCCACTCATCATATCTGACACCCAGTACAAACTCTTCAGTAAAGTGGACGTCGGCGGCATCGACAATTATAGAAGTCTTCAAGCAAACAACAGAGTTATTTACCGCAGTTTAGCGTCCAGCTCTTCCATAGCTCTGCCCAGCACCATTGGCCTCCTAGCTTCATTATTCCACCTGTCTTCAGCTATGACGATGGTTTTCAGTAAAGGCGTATGCACACTCGTAAATATCAAAAAGAAATTCTTTGGACATGAAGTCAAGGAATGCAAATCTGATTAG

Protein sequence:

>DPOGS210627-PA
MWMLFVPFLAAAAVCTAAEDWSYDYETKWPGVCTTGEKQSPINIMSRDAIVDKLGTHIKGPLVFRGYGSVNVSGANTGHTLKWTLEEDEPSPVVSGGPLRGNYSFVQFHLHWLSEHAIDGMKYPMEIHMVHMKTGLTAEEAVERPDGIVVIGILCQVHSGEESEFALGELQPSLPKLIERSVGAVPPTVLDLTRLFSPNMQSFYTYHGSITTPLCQEVVTWLVMDKPLIISDTQYKLFSKVDVGGIDNYRSLQANNRVIYRSLASSSSIALPSTIGLLASLFHLSSAMTMVFSKGVCTLVNIKKKFFGHEVKECKSD-