Monarch geneset OGS2.0

DPOGS212725
TranscriptDPOGS212725-TA1401 bp
ProteinDPOGS212725-PA466 aa
Genomic positionDPSCF300012 - 325195-332518
RNAseq coverage909x (Rank: top 14%)
Annotation
HeliconiusHMEL0035642e-6957.76% 
BombyxBGIBMGA013173-TA4e-6754.65% 
DrosophilaMnt-PD9e-2155.06% 
EBI UniRef50UniRef50_Q175Q64e-3368.27%Max binding protein, mnt n=2 Tax=Coelomata RepID=Q175Q6_AEDAE
NCBI RefSeqXP_001652093.18e-3468.27%max binding protein, mnt [Aedes aegypti]
NCBI nr blastpgi|1571137692e-3268.27%max binding protein, mnt [Aedes aegypti]
NCBI nr blastxgi|1700599374e-3433.33%max binding protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00056344.9e-18nucleus
GO:00063554.9e-18regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[200-287] IPR0115984.9e-18Helix-loop-helix DNA-binding
[208-260] IPR0010927.9e-14Helix-loop-helix DNA-binding domain
Orthology groupMCL26040 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212725-TA
ATGAGTGTGCGCCTCTTACCCCGCTCCCTTCCCCGCTTCAAGCCACTCGCCCTCCGCGGCGTCCGTCGTCAGTCGGTCGCGGAGCGCGGTCCATGTCGGTCGGTTGTACGCCGCCTGTCAGTGAACAGCCCTTGTGTCCGCGGCTCGTGTTCGCGGGAGCCCCGCCGGCTGCGTGTATTAATAGCGGTCGTTATCACGGCGACCTCGAGGACGAGCCGAGACGCCGCGAGGTCGCCGCCACCTGCGCGCCTCTCGCCGTCCCCTCCCCACCCCTCTTCGTCCTCGTCCTCTGATCTGTTCATCGTCAGCTCGGTTTTCTTGCAACGAGGAGGAGCCCGAGACGCCGTTGTCGTTGAGACTCACCGGGGTATCCGTGAGGCCCTTGACGCCGCCTGCCCCCGCCCCCGCTCCCGTACGGCAGCCGCCGCCCCCGCGCCCCTCGCCTTCCCGCCTCACCCGGCGCCCCTCGACTTCGTACAGGAACATTCCAGGCCGCGACACACGAACGGTCACGGACCGCCGCAGATCCCTTCCCCTCCGTCAGTGAGCAATACCGCCGCCGGCGGGGGCGGGGGCGGGGGCGGGAGCAGTGTCCCTCGCGCCGGCACCAGGGAGGTCCACAACAAGCTCGAGAAGAACCGTCGAGCTCATCTCAAAGAGTGTTTTGAGCTGCTGAAGAGACAGCTGCCGGCCGCCGGCGACGACAAGAAAACATCCAACCTCTCCATCCTGGGCTCGGCCATCAGATACATACAGGTGCTGCGGCGGAAGGAGCGCGAATGTGAGCACGAGATGGAGAGGCTGGCGCGGGAGAAGATCGCCGCCCAGCAACGACTGGCCGCGCTGAGGAGAGACGCCGCGGTCCGGGGGCGGCGGCTCGCGGCCCCGGACGAGGAGCTCGTGAACGGACAGTTGCTGGGGATGCCACTGAACGTGAGTCCTCTCGTACTGCTGATGCGAACGTCCACTGGCGGCGATCACTGGCGAATGCACGGAGACCCTCATCCGAAGTCGTCTCATCCTCGTATCCGTATGTTCCAGGCGGCGCCACCGCCCCCGCCCGCCGCCCCGGCCTCCTCGCGTGCTGAGTCTCCGCTGCGCTCCCTGAACCTGAGCACCAAACTGCGTCCGCTGCCCATACAGATCACACCGACCACCTCCGCACAAGTAAGTGTGGAGGAGCCTGGTGTTACGTCCACACTGAACAGTTTGGTGCGACCAGCTCAGATACACCTGCCGCTGTCACAGGTGGTGGCGGGCGGAGGGCTGGTGGTAGGCCCCGCGTTGCAGTTACTGCCGGCCGGTCTACGTGTGCTGCATAATGGTAACGATCACTCCACTCTCCACATCATACTGTCTTTATCAACACCGGGGCAACCCGTTTCTTCCCACAACATTTAA

Protein sequence:

>DPOGS212725-PA
MSVRLLPRSLPRFKPLALRGVRRQSVAERGPCRSVVRRLSVNSPCVRGSCSREPRRLRVLIAVVITATSRTSRDAARSPPPARLSPSPPHPSSSSSSDLFIVSSVFLQRGGARDAVVVETHRGIREALDAACPRPRSRTAAAAPAPLAFPPHPAPLDFVQEHSRPRHTNGHGPPQIPSPPSVSNTAAGGGGGGGGSSVPRAGTREVHNKLEKNRRAHLKECFELLKRQLPAAGDDKKTSNLSILGSAIRYIQVLRRKERECEHEMERLAREKIAAQQRLAALRRDAAVRGRRLAAPDEELVNGQLLGMPLNVSPLVLLMRTSTGGDHWRMHGDPHPKSSHPRIRMFQAAPPPPPAAPASSRAESPLRSLNLSTKLRPLPIQITPTTSAQVSVEEPGVTSTLNSLVRPAQIHLPLSQVVAGGGLVVGPALQLLPAGLRVLHNGNDHSTLHIILSLSTPGQPVSSHNI-