Monarch geneset OGS2.0

DPOGS206757
TranscriptDPOGS206757-TA906 bp
ProteinDPOGS206757-PA301 aa
Genomic positionDPSCF300316 + 73179-81324
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0111754e-10685.00% 
BombyxBGIBMGA009730-TA2e-8671.82% 
Drosophilalms-PA1e-3648.31% 
EBI UniRef50UniRef50_D6WRD68e-4449.74%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WRD6_TRICA
NCBI RefSeqXP_001122451.13e-4549.34%PREDICTED: similar to CG13424-PA [Apis mellifera]
NCBI nr blastpgi|2700101063e-4349.74%hypothetical protein TcasGA2_TC009463 [Tribolium castaneum]
NCBI nr blastxgi|2700101065e-4246.52%hypothetical protein TcasGA2_TC009463 [Tribolium castaneum]
Group
Gene OntologyGO:00063555.1e-25regulation of transcription, DNA-dependent
GO:00435655.1e-25sequence-specific DNA binding
GO:00037005.1e-25sequence-specific DNA binding transcription factor activity
GO:00036775.9e-25DNA binding
GO:00055157.5e-22protein binding
KEGG pathway 
InterPro domain[87-149] IPR0013565.1e-25Homeobox
[78-146] IPR0122875.9e-25Homeodomain-related
[78-156] IPR0090577.5e-22Homeodomain-like
[109-120] IPR0204792.2e-06Homeobox, eukaryotic
Orthology groupMCL22221 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206757-TA
ATGGAAGTTAAACCGGAAATGATTGGAGCTGATGATATTAAGGTTGACAAAATCGACAAATTACCGTTTTCTATAGACAGTTTGCTGGCGGACAAGAAGAATAGTGTTGGGACATCCGATTCAGATTTAAATGTGTTCAAAGAAAATTATGACGATGAGAGCAGTGGGTCGGAACAATTGGATGTGGAGACTTCTACGATTGACGTTCAGGAGTTTGCTGATGCTAGAGCGGATTATCAGACAGGTTCATGTTCACGCGGGAAGAGAGCCCGCACCGCTTTCAGTGCTCAACAGATAAAAAGTTTGGAAGCGGAATTTGAGAAGAACAGATACCTCTCAGTAGCTGCTAGAGGACGTCTAGCGAGACAGCTGAGACTCACAGAGACACAGATTAAAATATGGTTCCAAAATCGCCGTACGAAATGGAAACGTAAATATACAAACGACGTGGAGATATTGGCTCAGCAGTATTACAATAGTTTGGGCATAATAACCCCAAGGCCGATGTTTGTTGGTGATAGGCTGTGGATTTTCAACTATCCCAATCGTTTACAACCAACTCAACAACAACAATGGGCGAAATCGCTCAACAATATCGCTGGAGTTTCACACACGCCGATAGACAGAACCGTGTACAGGAGTTTATCTAAGCCGGATATACCATTTTTGTTGTCGCCTCCACCGCCGTATACAAACGAGCGTGTTCTATTAGAGAGAAGTGTCAGTTTGACAGGTGTCAAAGTTCCCGCTCAAACTCGCCTACCATCTCTACCCAGAGAGGTATACAGCAATATGGCAACGGCACAAAGGAGTTTGGATATGTACAAGAATAATATAATAAGCCACAGTCAGCAGGAATCGAATGTTGATCATTTAAGAAGATTGGAAGAGAATTTCAGTATTTAG

Protein sequence:

>DPOGS206757-PA
MEVKPEMIGADDIKVDKIDKLPFSIDSLLADKKNSVGTSDSDLNVFKENYDDESSGSEQLDVETSTIDVQEFADARADYQTGSCSRGKRARTAFSAQQIKSLEAEFEKNRYLSVAARGRLARQLRLTETQIKIWFQNRRTKWKRKYTNDVEILAQQYYNSLGIITPRPMFVGDRLWIFNYPNRLQPTQQQQWAKSLNNIAGVSHTPIDRTVYRSLSKPDIPFLLSPPPPYTNERVLLERSVSLTGVKVPAQTRLPSLPREVYSNMATAQRSLDMYKNNIISHSQQESNVDHLRRLEENFSI-