Monarch geneset OGS2.0

DPOGS207664
TranscriptDPOGS207664-TA1191 bp
ProteinDPOGS207664-PA396 aa
Genomic positionDPSCF300133 + 131986-133176
RNAseq coverage256x (Rank: top 41%)
Annotation
HeliconiusHMEL0026760.093.44% 
BombyxBGIBMGA010525-TA0.082.88% 
DrosophilaCG1620-PA1e-4041.67% 
EBI UniRef50UniRef50_D7EL503e-9862.65%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EL50_TRICA
NCBI RefSeqXP_970158.15e-9962.65%PREDICTED: similar to mesoderm induction early response 1 [Tribolium castaneum]
NCBI nr blastpgi|910950171e-9762.65%PREDICTED: similar to mesoderm induction early response 1 [Tribolium castaneum]
NCBI nr blastxgi|910950175e-11360.85%PREDICTED: similar to mesoderm induction early response 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.5e-12protein binding
GO:00036778.6e-07DNA binding
KEGG pathway 
InterPro domain[251-307] IPR0090572.5e-12Homeodomain-like
[151-201] IPR0009496.2e-12ELM2 domain
[250-299] IPR0010058.6e-07SANT domain, DNA binding
Orthology groupMCL14868 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207664-TA
ATGTCAGACTGTGCGCTGGTAACCAGTGTAAGCGAACACGATGCTAGTATGGATGTGGGAAACGACAAATCCCTCTTCGAGCCGACTATTGATATGATGGTAAATGATTTCGACGACGAGAGAACATTAGACGAAGAAGAAGCTTTGGCGGCAGGCGAGCAACAAGATCCGAAAGCGGAACTTAATAGCTTGCAACGTGAAGGTGATATGCCTTTAGAGGAATTACTTGCATTGTATGGTTATAACAGAGGTATGGATAAAGCAAGCCCTGAACAACCACCAGAGGTGGTACCGGAAGAAAATGAGAAAGCTGAGTCTGCTCTACAGCAGTTATACACTGAGACCACAAGCCCTGAAGCCACACGGTGTCTCCGCTCTGGCTCAAGGCCTCCTTCTGAAGAAGAAGATGATTATGACTATAGTCCCGATGAGGATGACTGGAAAAAAACTATCATGGTAGGTAGTGATTATCAAGCTGGTATACCAGAAGGTCTCTGCAGTTATGATGATGCTTTGCCATATGAGAATGAAGATAAATTGTTGTGGAACCCAAGTGTCCTTGATGAAAAGGTGATAGAAGATTATATGAGAAAAATATGTGCTATGAATTCCGGCACAGGTATTGATGCTGTGCCTAGAGGAAAGCAGCTGAGAGATGATGAAGAAGCATTGTTCCTATTGCAACAATGTGGTCATAATGTTGAGGAAGCTCTCAGGAGGAGAAGAATATCGGCACAAACCCCTGCCCACGCCAGTGTATGGTCCGAAGAGGAATGCAGAAACTTTGAAAACGGTATCAAAGTTCACGGCAAGGACTTTCACTTAATACGCCAACAGAAAGTCAGGACGAGATCTGTTGGGGAGCTAGTACAATTTTATTATATCTGGAAAAAAACTGAACGACATGATATATTTGCTAACAAGACGAGACTAGAAAAGAAAAAATACACACTACATCCTGGGCATACCGATTATATGGACAGATTTTTGGAGGAACAGGAAGCTACAGGGGCTGCTAATGTCGTCCGACCTGTCTCTCCGTCCCCTATGATGGTGTATGTACCTTCACCGGCCACCCAGCCGGATCCCTTGGCTTTGGGAGAGAAAGAGGTTTTCTCTCAATTAAATCCCCATACTACACCACCAAGAACCCTCTCCATCGAGGATCAAGAACCAGACGTTGTTTCCTAA

Protein sequence:

>DPOGS207664-PA
MSDCALVTSVSEHDASMDVGNDKSLFEPTIDMMVNDFDDERTLDEEEALAAGEQQDPKAELNSLQREGDMPLEELLALYGYNRGMDKASPEQPPEVVPEENEKAESALQQLYTETTSPEATRCLRSGSRPPSEEEDDYDYSPDEDDWKKTIMVGSDYQAGIPEGLCSYDDALPYENEDKLLWNPSVLDEKVIEDYMRKICAMNSGTGIDAVPRGKQLRDDEEALFLLQQCGHNVEEALRRRRISAQTPAHASVWSEEECRNFENGIKVHGKDFHLIRQQKVRTRSVGELVQFYYIWKKTERHDIFANKTRLEKKKYTLHPGHTDYMDRFLEEQEATGAANVVRPVSPSPMMVYVPSPATQPDPLALGEKEVFSQLNPHTTPPRTLSIEDQEPDVVS-