Monarch geneset OGS2.0

DPOGS214927
TranscriptDPOGS214927-TA3858 bp
ProteinDPOGS214927-PA1285 aa
Genomic positionDPSCF300163 - 348073-361056
RNAseq coverage1775x (Rank: top 7%)
Annotation
HeliconiusHMEL0161340.059.57% 
BombyxBGIBMGA000204-TA0.073.51% 
DrosophilaSin3A-PG1e-13957.80% 
EBI UniRef50UniRef50_E2C8T80.046.17%Paired amphipathic helix protein Sin3a n=7 Tax=Formicidae RepID=E2C8T8_HARSA
NCBI RefSeqXP_002423856.10.045.57%Paired AMPhipathic helix protein Sin3A, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3071924390.046.17%Paired amphipathic helix protein Sin3a [Harpegnathos saltator]
NCBI nr blastxgi|3287825420.043.51%PREDICTED: paired amphipathic helix protein Sin3b [Apis mellifera]
Group
Gene OntologyGO:00056341.6e-28nucleus
GO:00063551.6e-28regulation of transcription, DNA-dependent
KEGG pathwayphu:Phum_PHUM0892800.0 
 K11644 (SIN3A)maps-> Huntington's disease
InterPro domain[615-715] IPR0131947.6e-60Histone deacetylase interacting
[68-140] IPR0038221.6e-28Paired amphipathic helix
Orthology groupMCL10915 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214927-TA
ATGCTGTCTGTGGGCGGAAACTTCAGCCGTCCACGTGGTAATCCGCCCGTCATCAACTATCTGAAGGCTAACACTGTTCAGTACGCGCCGCCCAAACCACAGGCGCCAAGACTCAAGCTGGTGAGCCAACAGCCAGCCTTAATCCCCGGCCGTCAATCTCCGCTGGGAGGTCCCCTGCCAGCACCTCCAGCTGCTCAGTTCCAGCGGTTGAAGGTGGAGGATGCCTTATCGTACTTGGACCAGGTCAAGTACAAGTTCAACACACAGCCTCAGGTCTACAATGACTTTCTTGATATAATGAAGGAATTCAAGAGTCAGACGATCGACACGCCCGGCGTCATCACCCGGGTATCGAACCTGTTCAAGGGACATCCGGAGCTGATCGTCGGCTTCAACACCTTCCTACCGCCGGGGTACAAGATCGAAGTACAGAGTAACGGACAGGTCTCAGTGTCGATGCCATCGCCCACTGGTGCAGGCTGTCCGCCAGCAGGCGCCGTGGGCGTGGGAGTGGGCGCCAGTGGTGGCGTGGGCGTGGGCGGTGGTGGTAACGCCGTTGGCGTGACCGGTGCCGGTGTCATGATGGGTGTCCATCACCCTCCACCCCAACCACAACTAGTCCATCTGCTGCCTGTACCACAGTCTGTCAGTAACGCGATCGTTCACAACCTGTCTGTGAATGCCACGGCTACCAACACCCTGCATCATATATCACAAGCTCATCAGCAGATCGAGGCTGCAGCCATACACCATCCTCCAGGTTCAGCAGCTAATGCCAGCGCCAGTCACGCTGCTGCAGGTCAACCGGTGGAGTTCAACCACGCTATTGAATACGTCAATAAGATTAAGTCTCGTTTCTCGCGCCAACCGGACAAGTACAAGCGTTTCCTTGAGATATTACACGCGTATCAGAGGGGACACCGGGATCTCAAGGAACCCCACGCCAAGCAGCAGACGGAACAGGAGGTGTACGCACAGGTGGCGAAATTATTTGAGAAGCAAGACGACTTGCTGGCCGAGTTTGGTCAGTTCTTACCTGACGCGAAGGCTGTGACCAAGCCGACGCCCATACCACCTCATTCGAGATCACCACCGCCGCAGGTCCGTATGGAAGCTGAACTAGACCGCTTCCCCAGCTCGACGCCGTCATCCCCTCCACATGGGGCCGTGACACACGTGACGCACGCGACACACGTGACACCAGCCCCTACACCGACACACGCGGCACACGTGACTCACGTGACACACGCTCACGTCCCGCCCCAGCCACATCCAGTACACACACAGCCGCCTGCGCCCAAACACACCAGCGCAGCAGTAGTCACGCCACAACATCACCTCAAACGGTCGCCCAGCTTCACGTCAACAACACAAATAAGTTCAGGTGCGCCAGCAGCTAAGCGAGCCAGGGTCCGTGATCTATCAGTGTCCGAGGCTGGTAAAATGGCCGCCGCCAGTGACTATTCGTTCTTCGACCGTGCTAGGAAGGCCTTAAGATCGCAGCATGTATATGACAACTTCTTAAGATGTCTCCTGTTATTTACAAACGAGATAATATCATCATCTGAGCTGTTATGTGTGACGGCGCCGTTTCTGTGTCGCCACCCTGAGCTACAGAAGTGGCTTCAAGACTTTGTGGGACCTGTGTCACCACCACACACACCCACAAACACACATACAGGTGGATACAACAATAACTTCACAAGTTCAAGCAACCTTCTATACGAACGTACACGTTTAGGGTCAGAGAGTAAAAATAGATATGAACCCTTGGGTCCACTGGGAGCTCAAATGAGACACGAGAGACCTCAAGGAGATGCTGCTATGGATATAGATCTATCGACGTGTAAACGTCTGGGCACCTCGTACTGCGCCCTGCCACGGGAAGCAGCCGCTAGGAAATGTTCAGGACGAACACCGCTCTGCAAGGAGGTACTAAATGACACGTGGGTGTCGTTCCCTACGTGGAGTGAGGATTCTACTTTCGTGACATCACGGAAGACTCAATATGAAGAATATATATACCGCTGTGAGGACGAGAGATTTGAGCTGGACGTGGTCATAGAGACGAACGCAGCAACGATCAGAGTATTGGAAGGAGCTCAGAAGAAGTTGTCTCGAATGAGTCCAGAAGACGCTGCCAAGTACCGCCTGGACGACTGTCTCGGCGGACATTCACCCACCATACACCAGAGGGCGCTCAAGAGGATCTACGGCGACAAGGCTGTGGACATTATAGCTGGTTTGAAGAAAAATCCTGTTGTGGCCGTGCCGGTAGTGCTGAGACGGCTTAAAGCCAAGGAGGAAGAGTGGAGGGAGGCACAGAAGGGCTTCAACAAACAATGGCGCGAACAAAACGAGAAATATTATCTCAAGTCCTTAGATCACCAGGGGATAACGTTTAAGCAGAACGATTTGAAAGCTCTGCGCTCCAAGTGCTTGTTTAACGAGGTCGAAAGTGCGTTCGCTGTGAGGAAACCCGGGCCGCATCTGTTGAGCGACTACGGCACCAAGTCGAGACATGAAGCTATCAAGATAGCCCGTGATGCTGCGGAATTGCTGATACATCACGCTCGCCGACAGACCGGCATCCAGAAGGCCGAGAAGAGGCGCATCAAACTCCTGCTGAGGCAGTTCCTGCCGGATCTCATGGCGCATCCGCGACTTCCGCTCAGCGACGACGAGCATGAAGAAGAAAAGGAAGAACCAACGAGTCCCGGGAGCCCAGCGGCTGATCAGGCTGACGAGAAGACGACAGTCAAAAATGAAAAGCAAGAGTCCTCGGAATCCGATAACGCGTCTGATAAAAGTAGCCGCAACAACAAAAATGATAAAGAAAACAAACCAAAAAATAACAACACGGAAACTAAAGACAGCACAAATTCAATAAAACGAAGCATAAGCAATGAGGAGGCCAGCATCAAAATGGAGCTAAAGAATGAAAACGATCTCGAAGACGACTACAGAGATCACCCGCCTAACGAGTCCCGTTTCGTGTGTACATCATCGTGGTACCTGTTCCTACGGTTGCATGGTGTGTTATGCGCGAGGCTGTGCGGTGCTAGACAGGCGGCACAGCAGCTGGCTGCGGGTGAAGCAGCACGAGCGTCCACACGACCACCCAGCGTGGCGGCCGCGCTGCGACTGAAGCCGACAAATCTACCAGCGGATGCGTCATCTCCGTCGGAGTACTACAACGCCTTACTGGAGCTGGTTAAAGGTGTCTTAGATGGGAATGTGGAATCGTCGGCATACGAGGACGCGGCCAGGGAGATGCTTGGTATCAAGGCTTACCCGCTATACACACTAGATAAGGTCGTCTCTATCGCTGTCAGACAGCTCCAACATTGTGTGTCTGAGAGTTGGTCGGTTCGTGCGACCGAGTTGGCATCCCGCGGTCCCCGAGGCCCGCCTTACATCAGACGAGCGTTGAGAGCTCTCAGACCACACCATACCGCCTTCCTTGTTACATTTTACTTCGGCGACACATGCAAGGTTGGCTTCGAGCTGATGGAAGCGGCTGGCGAAGGTCGCGCCTCTCCTCACAGAGACCAACGCTTGTCACCAACACAGTCCCGTCGAGACACGAACGGCGATGCGTCCGTCCGTCACGGTGGCTGGTCGCCGTATAGTTACGCGCCGATCGCTACCAACAAGCCGGTGTTCCTGAGGCGGAACGCAAGGCGATCGGGCGCGGGGGCGACGGGGGCGCACACCGCTAGCGCTCCGCCGGACATCAGCGAAGCCCCCGCCCACGCAAGGAGAAGAAAACCGCCCAGGTTACACGACCACGACTGCGGAGTCACCAGCGGAGCGAGGTCACTATCACATTGCTACTCGCACCGACAATTAATCTAA

Protein sequence:

>DPOGS214927-PA
MLSVGGNFSRPRGNPPVINYLKANTVQYAPPKPQAPRLKLVSQQPALIPGRQSPLGGPLPAPPAAQFQRLKVEDALSYLDQVKYKFNTQPQVYNDFLDIMKEFKSQTIDTPGVITRVSNLFKGHPELIVGFNTFLPPGYKIEVQSNGQVSVSMPSPTGAGCPPAGAVGVGVGASGGVGVGGGGNAVGVTGAGVMMGVHHPPPQPQLVHLLPVPQSVSNAIVHNLSVNATATNTLHHISQAHQQIEAAAIHHPPGSAANASASHAAAGQPVEFNHAIEYVNKIKSRFSRQPDKYKRFLEILHAYQRGHRDLKEPHAKQQTEQEVYAQVAKLFEKQDDLLAEFGQFLPDAKAVTKPTPIPPHSRSPPPQVRMEAELDRFPSSTPSSPPHGAVTHVTHATHVTPAPTPTHAAHVTHVTHAHVPPQPHPVHTQPPAPKHTSAAVVTPQHHLKRSPSFTSTTQISSGAPAAKRARVRDLSVSEAGKMAAASDYSFFDRARKALRSQHVYDNFLRCLLLFTNEIISSSELLCVTAPFLCRHPELQKWLQDFVGPVSPPHTPTNTHTGGYNNNFTSSSNLLYERTRLGSESKNRYEPLGPLGAQMRHERPQGDAAMDIDLSTCKRLGTSYCALPREAAARKCSGRTPLCKEVLNDTWVSFPTWSEDSTFVTSRKTQYEEYIYRCEDERFELDVVIETNAATIRVLEGAQKKLSRMSPEDAAKYRLDDCLGGHSPTIHQRALKRIYGDKAVDIIAGLKKNPVVAVPVVLRRLKAKEEEWREAQKGFNKQWREQNEKYYLKSLDHQGITFKQNDLKALRSKCLFNEVESAFAVRKPGPHLLSDYGTKSRHEAIKIARDAAELLIHHARRQTGIQKAEKRRIKLLLRQFLPDLMAHPRLPLSDDEHEEEKEEPTSPGSPAADQADEKTTVKNEKQESSESDNASDKSSRNNKNDKENKPKNNNTETKDSTNSIKRSISNEEASIKMELKNENDLEDDYRDHPPNESRFVCTSSWYLFLRLHGVLCARLCGARQAAQQLAAGEAARASTRPPSVAAALRLKPTNLPADASSPSEYYNALLELVKGVLDGNVESSAYEDAAREMLGIKAYPLYTLDKVVSIAVRQLQHCVSESWSVRATELASRGPRGPPYIRRALRALRPHHTAFLVTFYFGDTCKVGFELMEAAGEGRASPHRDQRLSPTQSRRDTNGDASVRHGGWSPYSYAPIATNKPVFLRRNARRSGAGATGAHTASAPPDISEAPAHARRRKPPRLHDHDCGVTSGARSLSHCYSHRQLI-