Monarch geneset OGS2.0

DPOGS205688
TranscriptDPOGS205688-TA1806 bp
ProteinDPOGS205688-PA601 aa
Genomic positionDPSCF300250 - 153491-158002
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0072957e-8046.26% 
BombyxBGIBMGA009825-TA1e-9254.17% 
DrosophilaCG3711-PA5e-1525.08% 
EBI UniRef50UniRef50_F4X1B39e-9437.46%Transcription factor Sp4 n=4 Tax=Acromyrmex echinatior RepID=F4X1B3_ACREC
NCBI RefSeqXP_002425048.16e-7230.09%kelch repeat domain, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3320192763e-9337.46%Transcription factor Sp4 [Acromyrmex echinatior]
NCBI nr blastxgi|3320192765e-9537.52%Transcription factor Sp4 [Acromyrmex echinatior]
Group
Gene OntologyGO:00055155.4e-28protein binding
KEGG pathwaytet:TTHERM_004943608e-13 
 K00012 (E1.1.1.22, ugd)maps-> Starch and sucrose metabolism
    Ascorbate and aldarate metabolism
    Pentose and glucuronate interconversions
    Amino sugar and nucleotide sugar metabolism
InterPro domain[11-321] IPR0110431.6e-34Galactose oxidase/kelch, beta-propeller
[125-322] IPR0159155.4e-28Kelch-type beta propeller
Orthology groupMCL16997 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205688-TA
ATGTGGGTCTCTGTGGAGGGTTCAGCGGCGGTGGCGCCCTGTGCTCGTGGGAAACACTCGGCCACTCTGCTGGGCGGGTATGTGTATGTGCTGGGCGGCAGAGGAGCTGGTGGCGCGGTGCCCTTGAGGGACTTCTGGAGGTATTGTCTAGCAACAAGTAAATGGGAGCGTTTGGAGGCTAGAGGTGAACCTCCGCCCGCCCTTCAGGAGCACAGCGCCACCGCCCATCATGACAAGCTGTACGTGTTCGGAGGGGAGGCTGGGGCACTGGCAGAGACCCCGCTTTGGATATACGACACAACGATAGAGAGCTGGCGCAAGTTGTCCGGGCAGGCGAGCTCGGCGAGTCGCGCGAGGCGCGTGTCGCCGCGCGTGTCCCGCACCGCGAGTGCGCCGCGCGGCCGCCGAGGACACTCCGCACACGCGCTCAAAGACTGTCTCCTTATATACGGAGGCTATAAAGATCTGAGAGGGTCTACTAATGAACTCTGGGCTTTCCATTATGAGTCGGAGTCGTGGCAGGAGGTCCGCACGGCTACAGTGGGTCCGTCGCGCCACCGACACGCGGCCGCGCTCTACGACGCTCGCCTGTACGTGCACGCGGGTCAGTGCGACCTCCGAGATTGCAGCGACCTCTGGCACTACGACACCAGTGAGTCTTCCCCCGTCCCCCTCCACATGCTTTCGGCTGCTTCTTTCCTCGTCTCTCATGTTCTGCGTGTGTTTGTTCCAGTGTCTCGGGTGTGGACGCAGGTCCGTACTCCTGCCAAGACGTCCCCGAGCGCTCGCTCCGGCCACGCGGGGCTCCGGGCCGGGGCACATTTCTATATCTTCGGAGGAGAGGCTCATGGACACCCAACCAACGAACTGTGGCGCTTCCACTTCGCGACAGAAACTTGGGAGCGGATCCTGCAAAGTGTGAAGTGGCCGTCGGCCCGGGTAGACAGTCGCGCCCTGCTGGTGGGTGGGGCCGCCCCCCGCCTCCGCCCCACCCGCCCTGCCCCCGCGCCGCCCTCGTCCGCGCCGGTCCGCGAGGCGCCGAGCGGCTTCTTACGTGAGATCTCCAAGCTCTCCAGCTTCCACATCCGGCGCGCGGCGCGGTGCTCATACAGCGTGCTGGCCGGCGACCAGGACTCCACCGAGAGTCTGGTGCGGACGGAGCACTCCTCGCTGTCCAAGTCGCGCTCGGCCTACGTCATCGACGAGCGCCAGCCGGCCGACGGTGACGAGGACCCGCGGGGAACCTCCGACCTCGCCAGGGAACCGATCTCGGTCCCGGACTTCGCGGACATGATCCTGCCGACGCCCGTGCTGTCACCCGTCCAGACCACCAAACTGGTGTACCTCGACTCCGAGGAGGAGGAGGATATGAAGAGAGAAACCGAGCCCTACAAGAGAGAGAAGAACGGCACGCTCTCGCGATGTAGAGTCGGTCCGATGCCGAAATCCGCTTCGGTGAAGTTCACGACGCAGAGAGTGGCGGAAACGAGTGTGGCGACTGAGGACGAGGGCGACCTGTCCACGTCCGACTACGCGAGCGCCGAGCGAGTTCACAGGCTGTCGGCCGTGTCCTCGGGGTTCAGCAACCCTCACTACCTCGGGCCGGACGTTAGGAATTTAGGATCAGCCTTGACCCCGGACTCGGGCGTCGCTCCCGGCGACATCGAGCTGCAGGACCTGAGCGCGCGGAGGCCGGCGACGCGCGAGGAGCGACAACTGCACCTACTACTGGTGGGAGGCCGCGAGCCGCCTCACCCCGCGCTGCTGCAGAGGCCGCTCTCTCTGTGGAGCTACCGGCTACTGTAA

Protein sequence:

>DPOGS205688-PA
MWVSVEGSAAVAPCARGKHSATLLGGYVYVLGGRGAGGAVPLRDFWRYCLATSKWERLEARGEPPPALQEHSATAHHDKLYVFGGEAGALAETPLWIYDTTIESWRKLSGQASSASRARRVSPRVSRTASAPRGRRGHSAHALKDCLLIYGGYKDLRGSTNELWAFHYESESWQEVRTATVGPSRHRHAAALYDARLYVHAGQCDLRDCSDLWHYDTSESSPVPLHMLSAASFLVSHVLRVFVPVSRVWTQVRTPAKTSPSARSGHAGLRAGAHFYIFGGEAHGHPTNELWRFHFATETWERILQSVKWPSARVDSRALLVGGAAPRLRPTRPAPAPPSSAPVREAPSGFLREISKLSSFHIRRAARCSYSVLAGDQDSTESLVRTEHSSLSKSRSAYVIDERQPADGDEDPRGTSDLAREPISVPDFADMILPTPVLSPVQTTKLVYLDSEEEEDMKRETEPYKREKNGTLSRCRVGPMPKSASVKFTTQRVAETSVATEDEGDLSTSDYASAERVHRLSAVSSGFSNPHYLGPDVRNLGSALTPDSGVAPGDIELQDLSARRPATREERQLHLLLVGGREPPHPALLQRPLSLWSYRLL-