Monarch geneset OGS2.0

DPOGS207970
TranscriptDPOGS207970-TA1071 bp
ProteinDPOGS207970-PA356 aa
Genomic positionDPSCF300090 + 564936-573973
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0024243e-11698.18% 
BombyxBGIBMGA000322-TA9e-14192.70% 
Drosophilabyn-PA4e-10482.78% 
EBI UniRef50UniRef50_E0VR502e-10381.28%Brachyury, putative n=1 Tax=Pediculus humanus corporis RepID=E0VR50_PEDHC
NCBI RefSeqXP_002428594.14e-10481.28%brachyury, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420159278e-10381.28%brachyury, putative [Pediculus humanus corporis]
NCBI nr blastxgi|865154082e-10663.81%brachyury [Tribolium castaneum]
Group
Gene OntologyGO:00056341.2e-147nucleus
GO:00063551.2e-147regulation of transcription, DNA-dependent
GO:00037001.2e-147sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[5-345] IPR0016991.2e-147Transcription factor, T-box
[25-208] IPR0089671.1e-73p53-like transcription factor, DNA-binding
[5-30] IPR0020701.8e-22Transcription factor, Brachyury
Orthology groupMCL13074 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207970-TA
ATGGCTGCATCACATATATTAAGCGCAGTGGAACCTACGGTCGGTGGAGCTTCCAGGGCGGGAAGAGAAAGAGAAGTCAACGTGGCGTTAGACGACAGGGAGTTGTGGGTGCGGTTCCAGACTCTTACTAACGAAATGATCGTTACTAAGAACGGACGACGTATGTTTCCCGTTGTAAAAGTTACTGCCAGTGGGTTGGACCCGACGGCGATGTATACAGTTCTCCTCGAATTTGTGCAAGTGGACACTCACCGCTGGAAATATGTTAACGGAGAGTGGGTTCCCGGTGGTAAAGCAGAAGTTCCACCATCTAATGCAATCTACATACATCCGGAAAGTCCTAATTTTGGAGCTCATTGGATGAAGGAACCAATATCATTTGCTAAAGTTAAATTGACGAACAAAACCAACGGAAATGGACAGATAATGCTGAATTCTCTACACAAATACGAGCCAAGAGTACACCTAGTGAAAGTCGGAACGGATCTACGTCGTATCATGACATATCCGTTTCCAGAAACACAGTTCATAGCTGTGACGGCCTACCAGAACGAAGAAGTGACTTCACTTAAAATTAAATATAATCCCTTTGCGAAGGCCTTCTTAGATGCAAAGGAACGCCCTGAAGGTTATTACCAGAGGGATTTCGTTGGGACACATTATCCGCAGCAAAGTTCTTCACCTCATCAATATCCCCAATTTGGCGGATGGTTCGTGACGTCACAGTCTCTGTATGGCAGCAGTAACTCTTCGTCAAGAAGACCAGCGCCGTACCCTCCACGGCCGCCGTCGCGTCCTAGAACTCTCTCTCCACCGTGTGATTTGAACAGCTACGGCTACCAACCCTCGGAATATGTGGGCACTGGTGACGTAGCATATAGCACTGAGGGTATGTCGTTTGGATCTACAGGTCCTCCTTTAGAAGCGGCAACTGCTAACCAAAACGACCGGTCTACTCCGTCCGGCTCTGAACACCAAGTTGGTGAAAGAGAATATAAATATAACACTGAAGAAGAACGTCATTCACCATGTGAAGAAACCACGCTAACGCCACGTTCGCCTGCACAGTGA

Protein sequence:

>DPOGS207970-PA
MAASHILSAVEPTVGGASRAGREREVNVALDDRELWVRFQTLTNEMIVTKNGRRMFPVVKVTASGLDPTAMYTVLLEFVQVDTHRWKYVNGEWVPGGKAEVPPSNAIYIHPESPNFGAHWMKEPISFAKVKLTNKTNGNGQIMLNSLHKYEPRVHLVKVGTDLRRIMTYPFPETQFIAVTAYQNEEVTSLKIKYNPFAKAFLDAKERPEGYYQRDFVGTHYPQQSSSPHQYPQFGGWFVTSQSLYGSSNSSSRRPAPYPPRPPSRPRTLSPPCDLNSYGYQPSEYVGTGDVAYSTEGMSFGSTGPPLEAATANQNDRSTPSGSEHQVGEREYKYNTEEERHSPCEETTLTPRSPAQ-