Monarch geneset OGS2.0

DPOGS211183
TranscriptDPOGS211183-TA768 bp
ProteinDPOGS211183-PA255 aa
Genomic positionDPSCF300007 + 496709-514358
RNAseq coverage138x (Rank: top 55%)
Annotation
HeliconiusHMEL0124262e-3447.60% 
BombyxBGIBMGA003172-TA2e-2594.74% 
Drosophilamid-PA1e-1763.08% 
EBI UniRef50UniRef50_E3XBA71e-1945.71%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XBA7_ANODA
NCBI RefSeqXP_972670.12e-2842.17%PREDICTED: similar to GA19742-PA [Tribolium castaneum]
NCBI nr blastpgi|910829194e-2742.17%PREDICTED: similar to GA19742-PA [Tribolium castaneum]
NCBI nr blastxgi|910829191e-2942.17%PREDICTED: similar to GA19742-PA [Tribolium castaneum]
Group
Gene OntologyGO:00056343.4e-18nucleus
GO:00063553.4e-18regulation of transcription, DNA-dependent
GO:00037003.4e-18sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[84-159] IPR0016993.4e-18Transcription factor, T-box
[125-164] IPR0089678.2e-09p53-like transcription factor, DNA-binding
Orthology groupMCL22674 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211183-TA
ATGGATGTGTGGCCGTCGAGGATGGTGGCGGAGGAGGGTCGCACGAGGGCCACTGACTTCTCCATAGCGGCCATCATGGCCAGGAGTTCGGATCAGCCGCGCAGCCCAGGAGCCAGTTCGCCTTATCTCGGTCCGTCTTCAAGACAGAGTTCCCCAGCATCTCTGAGCTCCCCCGCATCAAGTTGTCGTTCGCCAGCTCCTGAAGAAGATGTCGAGGTGGATGTAGAGCAGTGCTCTGATGGAGAAAGGGATGCCTCGGACACTGCTGCCCCATCACCCGCCCCTTCATCAGAGTTAGGTGAAAGGGACACCCCTTCCCCGGCGCGGCCTCTGCCAACGCCGGTGACATCTTGTAATTGTGAGGAACTGCTCTCTGTCGACTGTCAGTTGGAGACTAAGGAACTGTGGGACAAGTTCCACGACCTGGGCACGGAGATGATCATCACCAAAACTGGGAGGAACTTTCCACATCTACGTCACCTGTGTGCGGGCGACCGCGATGTTTGGGCTGGCATTCGTCACGGGGCGTCCGCTTCTACTGCCATGATTGATGACCGCTGCGAACCTGAATGGTTCTTCCCTGGTTCTAAGCGAGTGCTTCCTTGTGTGAGGCGTGATAATGTCGATGTAACGGCAAAGGCGCCGCCGCCTCAGCATCAACTCATCACAATACACCCGTTCAACACGCCCGCCATATTCGGACGTGCTCTTGACAGCTTACATGTTTACGTCTCCAGCTCCTGTCAGATATCGTGTGAACGGAATTGA

Protein sequence:

>DPOGS211183-PA
MDVWPSRMVAEEGRTRATDFSIAAIMARSSDQPRSPGASSPYLGPSSRQSSPASLSSPASSCRSPAPEEDVEVDVEQCSDGERDASDTAAPSPAPSSELGERDTPSPARPLPTPVTSCNCEELLSVDCQLETKELWDKFHDLGTEMIITKTGRNFPHLRHLCAGDRDVWAGIRHGASASTAMIDDRCEPEWFFPGSKRVLPCVRRDNVDVTAKAPPPQHQLITIHPFNTPAIFGRALDSLHVYVSSSCQISCERN-