Monarch geneset OGS2.0

DPOGS210302
TranscriptDPOGS210302-TA2235 bp
ProteinDPOGS210302-PA744 aa
Genomic positionDPSCF300305 - 34936-61918
RNAseq coverage428x (Rank: top 29%)
Annotation
HeliconiusHMEL0171650.064.56% 
BombyxBGIBMGA013826-TA1e-11863.69% 
DrosophilaMyb-PA3e-8438.38% 
EBI UniRef50UniRef50_D6WRL95e-10443.64%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WRL9_TRICA
NCBI RefSeqXP_002431279.14e-10442.21%pre-mRNA-splicing factor cef1, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3825462447e-11663.69%Myb transcription factor [Bombyx mori]
NCBI nr blastxgi|3825462447e-11563.99%Myb transcription factor [Bombyx mori]
Group
Gene OntologyGO:00055155.5e-33protein binding
GO:00036777.3e-24DNA binding
GO:00063557.3e-24regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[71-497] IPR0154952.7e-101Myb transcription factor
[120-218] IPR0090575.5e-33Homeodomain-like
[128-173] IPR0122877.3e-24Homeodomain-related
[119-168] IPR0010058.7e-18SANT domain, DNA binding
[120-166] IPR0147782.1e-16Myb, DNA-binding
[445-546] IPR0153959.5e-14C-myb, C-terminal
Orthology groupMCL12577 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210302-TA
ATGCAATTTCAAAGTAATGAAGAAGACGATTTTCAATATCGGATTTTTGGAGAATCCTTCAACTTTCCAACCGAAAACAAATCGACCGGCCGGCGTAAAAAACGCAGCGGCTATGATTCAGAGTCGAGTGACTATTCAGAGGATGACACGTATGAAGATGTACCGCCTCCAACCAAAGGCTCTGGGCCGCGGAAGAATATCAATAGGGGAAGGTGGACCAAAGATGAGGACAAACGTTTGAAGGTTTACGTTAAGATGTACAATGAGAATTGGGAGAAGATAGCGAGTCAGTTTCCTGATAGATCTGATGTTCAGTGTCAACAGAGGTGGACCAAGGTTGTCAACCCTGAACTAGTCAAGGGTCCCTGGACGAAAGAGGAAGACGAAAAAGTTATGGAGTTGGTAGCGAAGTACGGACCAAAAAAATGGACTCTCATAGCGCGACATCTCAAAGGCAGAATAGGGAAACAATGCAGAGAGCGTTGGCACAATCATCTCAATCCTTCAATAAAGAAAAGTGCGTGGACGGAGCATGAAGACAGAGTCATATATCAAGCTCACAGACAGCTTGGAAACCAATGGGCGAAAATAGCGAAGCTCTTACCCGGAAGAACTGACAATGCTATAAAGAACCATTGGAACTCAACAATGAGAAGAAAATACGAGCCGGAGTTACTTGACAGTTTTGAACACTTGAGGAAGAAGAAACGAAAGGAAGAAGATACACAACACAATGATGTGAGTCAAACATTGAATATACTTACCACGGTGCTATTACCCGACTTCATCGACAGAAGGACATGGTCGGACACACTGAACGAGTCGAGTCAGTCATCAAGTGCACCGGCGGTTCATCTTCGACAGTTGTTAAGAGAGAGGGCACGAGGCTCACTGGCACCAGCGGATAGCGTCGAGATAGTCGATTCGCCGTTTAGATTCGTCAATTTAGAGTCACTGCCTTTGAATTCACCAGTGAAGAATTATTTGAGTCAAGCCTCCACAAGCGATAATAACAGTCAAAATACAATAACTTACTCAATACAATCTGAAGCTGATAAAGAAATAGTGGTACCGTCAATATACTCACCTCGAGACTCCCCACCGCCTATATTAAGAAGGGCCAAGCGGAAAACAGCTGACACCACAACACCTTCAAGACAACCATGGTCGGATCCTCTATCTCGTGTAATGGAGAGCGGTGCTCCTCTACAAGCATTACCGTTCAGCCCATCTCAGTTCCTGGTAGCGCCGCTTGGGAAACAGGACGCGACCCCGCTTAGGACTAAGGAATTGGGTGTAAGTCCTCTATTGCACACGCCGACACCGAACCTAACGCCTGGCGGAACACAGTTTGAACAGAAACATACACCTAAAACCCCGACCCCGTTCAAGTTAGCGATGGCCGAAATTGGCAAGAAGTCAGGTTTGAAATATGAACCGTCCAGCCCGGAGCTGCTGGTGGAGGACATCACTGAGATGATACAGCGAGAGAACTCCGACAGTCTACACGAGTGTCTACTGTCACACGACCAAGAAGCGCAGATGAGTTCAGACTCCGGTATATCAAGCGTGCAGCGCGGTAAAGAGAACGTTCCTGGTGTACGTCGTTCGCGTAAAGCGCTCGCACATACATGGGGAGCCGCTACTAGCACGCCGCGAGCGAGACTGCACGTGCCCGATGTATGCTTCGGAGTTGAGACGCCGAGTAAGACCCTGGCCGGCGACAGTTCCGTTCTATTCTCGCCGCCATCGATAGTGAAGCATTCCCTCCTAGAAGAATCTACAAGCATCATATCAGAGAACACGCCAGAAGCCTACGAAGAAATTAAGGTACAACACAGGTGTCATGATCCTAACCCTAACTCCTACTACGAGCCTGTGTTTAGGAATATCACTAACGAATCGCCTAGAACATTCGCTAGGTTAAACGCGGACAGACTAAAAACTATAACAAGCGCAAATCTATGTGACAGCTCGTTGGCGCCAAAAGATGTTCTTACGAACGCGTTAGTTGATCACATATACAAAGTGACCAATCAGAAGCCGTCTACGTCACGCATTGACGAAATATTGACCAATCAGAAACCTTCTACGTCACACATAAACGATATGTCTACCAATAAAATTAGTAAACCAAATGACACGGAAAAAGAGAACCAATGGTGGCAGGTTGAGAAAGATTCCGGTATAGATGATCTTTACAACGACTATTACATTTTTACTAATAATATATAA

Protein sequence:

>DPOGS210302-PA
MQFQSNEEDDFQYRIFGESFNFPTENKSTGRRKKRSGYDSESSDYSEDDTYEDVPPPTKGSGPRKNINRGRWTKDEDKRLKVYVKMYNENWEKIASQFPDRSDVQCQQRWTKVVNPELVKGPWTKEEDEKVMELVAKYGPKKWTLIARHLKGRIGKQCRERWHNHLNPSIKKSAWTEHEDRVIYQAHRQLGNQWAKIAKLLPGRTDNAIKNHWNSTMRRKYEPELLDSFEHLRKKKRKEEDTQHNDVSQTLNILTTVLLPDFIDRRTWSDTLNESSQSSSAPAVHLRQLLRERARGSLAPADSVEIVDSPFRFVNLESLPLNSPVKNYLSQASTSDNNSQNTITYSIQSEADKEIVVPSIYSPRDSPPPILRRAKRKTADTTTPSRQPWSDPLSRVMESGAPLQALPFSPSQFLVAPLGKQDATPLRTKELGVSPLLHTPTPNLTPGGTQFEQKHTPKTPTPFKLAMAEIGKKSGLKYEPSSPELLVEDITEMIQRENSDSLHECLLSHDQEAQMSSDSGISSVQRGKENVPGVRRSRKALAHTWGAATSTPRARLHVPDVCFGVETPSKTLAGDSSVLFSPPSIVKHSLLEESTSIISENTPEAYEEIKVQHRCHDPNPNSYYEPVFRNITNESPRTFARLNADRLKTITSANLCDSSLAPKDVLTNALVDHIYKVTNQKPSTSRIDEILTNQKPSTSHINDMSTNKISKPNDTEKENQWWQVEKDSGIDDLYNDYYIFTNNI-