Monarch geneset OGS2.0

DPOGS204254
TranscriptDPOGS204254-TA891 bp
ProteinDPOGS204254-PA296 aa
Genomic positionDPSCF300046 - 373961-375310
RNAseq coverage120x (Rank: top 58%)
Annotation
HeliconiusHMEL0151884e-7395.65% 
BombyxBGIBMGA007528-TA2e-6890.58% 
DrosophilaCG14881-PA7e-4734.12% 
EBI UniRef50UniRef50_E2AG803e-5441.24%Armadillo repeat-containing protein 7 n=10 Tax=Neoptera RepID=E2AG80_CAMFO
NCBI RefSeqXP_001120760.13e-5043.20%PREDICTED: similar to CG14881-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3287843353e-5943.30%PREDICTED: hypothetical protein LOC724856 [Apis mellifera]
NCBI nr blastxgi|3287843353e-6243.30%PREDICTED: hypothetical protein LOC724856 [Apis mellifera]
Group
Gene OntologyGO:00054886.3e-19binding
GO:00081525.3e-11metabolic process
GO:00164915.3e-11oxidoreductase activity
KEGG pathwayhar:HEAR15165e-12 
 K00625 (E2.3.1.8, pta)maps-> Propanoate metabolism
    Taurine and hypotaurine metabolism
    Pyruvate metabolism
InterPro domain[22-163] IPR0160246.3e-19Armadillo-type fold
[25-161] IPR0119894.6e-13Armadillo-like helical
[180-270] IPR0025395.3e-11MaoC-like dehydratase
Orthology groupMCL12412 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204254-TA
ATGTTTAGCAGTAAAACGCAACTAAGAAAGAGAACACCAGAAAATGGAACGGATCGCGAAAGTTATTTATCACTTTTAGTCGATGAATACATTAATTCTTCATCGTTTGATGCCAAACATCAGGTTTTAGCAAATCTTGCTAACTTTGCTTACGACCCAATAAACTACGGTTTTATAAGAAATGTTGGTGTCTTAGATATATTTATCCATGTGCTAAAGAATGCAAATAACAGTAAACTGCTTCATTTTGCCACAGCAGGAATCTGCAACTTGTGTATAGATCCAGATAATGTTGATTATATTGTGAATAATGGAGCTATTGAACCTATTTCCACTTTACTCAACTCCGACCATGAAGAAACATTAGCTGATGCTATAACAATATTTATTTACTTATATGACAGTCATGCAAAAGATAAAATACATGACATAAAAACAGTTAAAAATATTGAAGCACTTATTAAATCAGAAAATCAAAAGATCTGTAAGTTTCAATCAACATTCAGCCGTGCGGCTTTCAAAGCTGGTGATAAAATAAGAATACAGAAAACTCTAACACAGAAAGATCTGGATACATTCTCAAATTTGACGAGCGATCACAATTACCTCCATAAGAATAATGGCAACAAAAGACCAATTGTTCATGGGGCGTTTTTGAATGGACTCGTCGCTGGGCTTATTGGAACCCATCTGCCTGGACCTGGCACTGTGCTAGTGTCACAGACCATGAAATTTCCTAATAAATGTTTTGTTGGAGAGAAGCTGACTATAAGTGTGGAGCTGGTCGATGTTAGAAAAATACTCAAGGTTAAATTTTTCTGCATTGTTGAAGAGGAAAAAAAGGTTGTGTTTGAAGGTGAAGCAAAGTTGATGCTTGCTAAAGACTGTTAA

Protein sequence:

>DPOGS204254-PA
MFSSKTQLRKRTPENGTDRESYLSLLVDEYINSSSFDAKHQVLANLANFAYDPINYGFIRNVGVLDIFIHVLKNANNSKLLHFATAGICNLCIDPDNVDYIVNNGAIEPISTLLNSDHEETLADAITIFIYLYDSHAKDKIHDIKTVKNIEALIKSENQKICKFQSTFSRAAFKAGDKIRIQKTLTQKDLDTFSNLTSDHNYLHKNNGNKRPIVHGAFLNGLVAGLIGTHLPGPGTVLVSQTMKFPNKCFVGEKLTISVELVDVRKILKVKFFCIVEEEKKVVFEGEAKLMLAKDC-