Monarch geneset OGS2.0

DPOGS211367
TranscriptDPOGS211367-TA1563 bp
ProteinDPOGS211367-PA520 aa
Genomic positionDPSCF300173 + 773608-778350
RNAseq coverage322x (Rank: top 35%)
Annotation
HeliconiusHMEL0027920.071.81% 
BombyxBGIBMGA008364-TA0.079.96% 
DrosophilaCoRest-PG1e-8258.15% 
EBI UniRef50UniRef50_D2A0644e-15261.40%Putative uncharacterized protein GLEAN_07336 n=2 Tax=Tribolium castaneum RepID=D2A064_TRICA
NCBI RefSeqXP_392644.27e-13457.40%PREDICTED: similar to REST corepressor 3 [Apis mellifera]
NCBI nr blastpgi|2700052921e-15161.40%hypothetical protein TcasGA2_TC007336 [Tribolium castaneum]
NCBI nr blastxgi|2700052924e-14958.87%hypothetical protein TcasGA2_TC007336 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.4e-13protein binding
GO:00036779.9e-08DNA binding
KEGG pathwaydre:5692944e-92 
 K11829 (RCOR1, COREST)maps-> Huntington's disease
InterPro domain[114-181] IPR0090572.4e-13Homeodomain-like
[44-97] IPR0009491.2e-11ELM2 domain
[129-177] IPR0010059.9e-08SANT domain, DNA binding
[390-434] IPR0147781.1e-07Myb, DNA-binding
Orthology groupMCL11584 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211367-TA
ATGGTCTTGGCTGAAAGAAATAATGATGTTCGAAACGGCAAGCGTTCGAGAGGACCAAGCCCGAATGGCCATGGTAGCCCTGATTCTAGCTCCGAGGATGAGAATGTAGTGCCATTTGCAGCTGAAAAAATAAGGGTTGGTCGAGATTATCAGGCTGTGTGTCCAGAGTTGGAACCTCTGGAGCAAAGAAGACCTGATCAGATTTCGGATAGAGCTCTTTTAGTTTGGTCACCTACATGTGACATATCTGATACCAAATTGGATGAGTACATAACAACAGCTAAGGAAAAATATGGATACAATGGTGAACAAGCTCTTGGAATGTTGTTCTGGCACAAACATGATCTCAATAGGGCTTCAATGGACCTTGCAAACTTTACACCTTTTCCAGATGAATGGACAGTGGAAGACAAGGTGTTATTTGAGCAGGCTTTCCAGTTTCATGGCAAGAGCTTTCATAGAATAAGGCAAATGTTGCCAGATAAATCAATTGCATCATTAGTGAAGTATTACTACTCATGGAAAAAGACAAGAGCTCGTACATCGTTGATGGATGTTGTGAGTGAGGGCCGTAATGCTGCAGGATCAGGAAGTGGCAAAAGGGATTCCGGTGCTGGTTCTGAACCTGGCGGTTCAGATAAAGATTCTGACAATGATGAAAAGAAGTGGACGTTACACCGTGGTGTAGTACGTGGTGGGAACTGTGGGATCTCGCGCGCCGGTTTAGAGGGTGACGGAGGCGATGGGGGCAAGTGGTGCACCGTGTGCGGCATTCTGTGCTCCCAGACCACGCCGCACAACTCGCACAAGTTGTGTCAGGCATGCCTCGTGCACGCCAGACGGACGGGCAGTATGCGACCTCTGTGCGGGCCTTCTGGGAGACGAGGTGCGGGCAAACAGCAGCGTTACAAGCATCGCCTGCCTCGTGGCATTTACATCAACCACGACGATCTGGTTGCCATGGCGACGGGGCCGCAACCCGGCGACCGCAACCACAACCAGAACCAGAACCAGGGCGAGGCTGTGCTCAGAGCAATGGACAGAGAGATTATATCGCTGAAGAGACAAGTTCAACAAAACAAGCAGCAATTAAGTGCATTGAAGCGTAAAGTTGGTGATACCGGAGTAGAGGAGCTTAGACCTGGCGAACCACCCGCCAAGATAAACTCCAGGTGGACCAACGACGAACTACTAATGGCAGTGACGGCTGTTAGGAAGTATGGCAAGGATTTCCAAGCGATTGCAGAAACACTCGGAACAAAGACGGAATCCCACATTCGTACATTTTTCATTTCTTACCGCCGTCGATACAACTTGGATGCCGTACTCAGAGAGCATGAGGCAGACAGACAAAATGAAAATCATATACAACCAGGCACAACAAGCACAGAGTCGAATGAAAATGATGTTGACAATAGTAATGCCAACAATGGAACCGGATCACCACAGACTAACGCTAAGGATGAAAAGACAGAGGTAGATAGTGAGGGAGTAGCAATCGGAGCGTCAGCTGAACAAGGCGGGCCGCCCACCACGCCGCCAAAACACAAGCATGCTAAGTGA

Protein sequence:

>DPOGS211367-PA
MVLAERNNDVRNGKRSRGPSPNGHGSPDSSSEDENVVPFAAEKIRVGRDYQAVCPELEPLEQRRPDQISDRALLVWSPTCDISDTKLDEYITTAKEKYGYNGEQALGMLFWHKHDLNRASMDLANFTPFPDEWTVEDKVLFEQAFQFHGKSFHRIRQMLPDKSIASLVKYYYSWKKTRARTSLMDVVSEGRNAAGSGSGKRDSGAGSEPGGSDKDSDNDEKKWTLHRGVVRGGNCGISRAGLEGDGGDGGKWCTVCGILCSQTTPHNSHKLCQACLVHARRTGSMRPLCGPSGRRGAGKQQRYKHRLPRGIYINHDDLVAMATGPQPGDRNHNQNQNQGEAVLRAMDREIISLKRQVQQNKQQLSALKRKVGDTGVEELRPGEPPAKINSRWTNDELLMAVTAVRKYGKDFQAIAETLGTKTESHIRTFFISYRRRYNLDAVLREHEADRQNENHIQPGTTSTESNENDVDNSNANNGTGSPQTNAKDEKTEVDSEGVAIGASAEQGGPPTTPPKHKHAK-