Monarch geneset OGS2.0

DPOGS201152
TranscriptDPOGS201152-TA4011 bp
ProteinDPOGS201152-PA1336 aa
Genomic positionDPSCF300065 - 25436-30907
RNAseq coverage1011x (Rank: top 13%)
Annotation
HeliconiusHMEL0031850.073.02% 
BombyxBGIBMGA003928-TA0.069.55% 
Drosophilawge-PC5e-6260.11% 
EBI UniRef50UniRef50_Q0C7258e-12045.10%Phd finger transcription factor n=1 Tax=Aedes aegypti RepID=Q0C725_AEDAE
NCBI RefSeqXP_001647947.11e-12045.10%phd finger transcription factor [Aedes aegypti]
NCBI nr blastpgi|1571033633e-11945.10%phd finger transcription factor [Aedes aegypti]
NCBI nr blastxgi|1571033639e-16833.13%phd finger transcription factor [Aedes aegypti]
Group
Gene OntologyGO:00036771.2e-11DNA binding
KEGG pathway 
InterPro domain[1207-1324] IPR0010251.2e-11Bromo adjacent homology (BAH) domain
Orthology groupMCL21924 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201152-TA
ATGTGCGGGACACGGCGTGTTGTGGAGTTCAAACAGGAAGCGCAAGTGTCGATGGAGAGTATGATGGTGGGCGGCATGGGTGGCGTGGGCGTTGGGTCCGTGGGCAGTGGCATGTCGTCCGTGGGCGGTTTTCAGCTGGTCCGAGAGCCGTCCTCGGGAGCTCTGCTGCTCCTACCCGCTCCCCCCGACCTCCCTCACGCCGTGGTGTGGGGCGGAGTGCCTTATCCGTCGACACCCTTGCTGCTGCCGCCCGCCCCGCATCCGTCTCACCACCTCCAGCTGCTCCCGGGCGATCTGCTGGCGTCTACTGCTACGCTACAACACACCCACACTCATTCCACGCGGCTGGTCACCCTCGCCCCCGCTCCCCCCGCACAGCCTCACCCTCACCCGGTCGATAAGAGGAAACCCCTCATGCAGCCGATAATAACCCCTCACACGCTCATCAAAATAGAGCCGGAGCCGCCGCAGGAGAAGCCCCAGGCCTTCGCTCCCGAACCCGTCCAGCAGCCGATACTCACCACGCATCTCTACTACCAGCCGGATTTCCAAGAGCAGGCGTGCAGGCCTCAGGGTCCCCCGGCGGCCCCTCAGACTCCGCCGCCCGAAGTCCCCGCACACAAGGACGCCTCTAACCAGACAGATCACATTGACGAGGACGACGACTCTCCTATCAACGCGGACGAGGACGAGCGTGAGGTCGCGTGTGTCGGGGTCGCGCACTACCCGGAGGGCTCGAACATTATACAGATACAACCATTGCATCAGTCCATAGACGGCGCGGAGGAAACCACTCTGGAGGGACTCGTGGCGGCCACCGTGGTGGGGGCGGTGGAGAAGGCGATCAATGCCATGGAGAGGGAAGAGGCCGCCGATAACTCCAGCAGCAGAGGCTCCCACGACCTGGTCATCGACACAGCCGGCAGCATGACGCCCCTGCAGGCGCAGCAACGCCCGCCCGTCGATGTCAGCGGCCTGGAACTCCTCTCGAACAGTATCGAGCAGTTTGAGCGGACGACTCCCAGCAACCACGCGGCCGGCTCCGACCAGGCGCCGCTTACGATCGACACGAGGCCGTCTACTAAAATAAACATACTCATAAAAACCTCACCGAGGTCTCCGTCGCAAGACGATGATGTAGTAGAGACACATAAGATAAGGTTCCAGTTCCCTCTAGTGGAGACAGGTGACAGCGACAAGCCGTCCCTGGATGGGTTGGGGTTGCTGTGTGCTTTGGCGGAGCAGAGATTCATGGAGGAGGTTGAGGAGAGCCCCTCCCCAAATATGCCCTCGACCTCGCGAACTTTTATTAAGACGGAGCTGCTGAGTCCCACAGAAAGGGAAAGGGAGAGGGAGATATCGTCCGAATTGAAAAAAGAAAGACACAGACACAGAGATGACGCTTCGGGCGAGAGAAGGAGAAAGAGAAGTACAGATAAAGAAGAGAGATTACGACACAAGTTGGAAAAGATGCGTCGACACAAAAGGGAGAGAAAGGACACCGACAAATCTGAAGGTGGCGAGTTGGAGGCGAGTTTACGGCGAGTGACGGCATGTTCCTGTGGGATGGTGAATTGTTCCCACACGTCGAGCGTCCCGTCCGCGCAGGCGCTGGTGAACGCCATGGAGAAAGATATGAGGGAACGTCTACAAGACCTGCAGCGGCAGTGTGACGAGAAGCGCGCTCAGCTGGACGCCCTTACTCCGCCACTGCCGGCGCTCGTCACGCCCGCACCGTGCCTACAGCTCAGTGCGAGTCCAGCTCTCTCACCGGACTCCGATAGAGGCTCATCCAAGAAACGTAAAGTGGGAAGACCGAGGAAAGTCTCCAGTCCCGACTCCACGGAAACCATCGTAGCCAAGAAACCGAAGTCGAAAAACACTCTCGTCGGTTACTTACTGGCGAAAGGAAAACTTAAAGGGAACATATTGTACTCAAAGGGCGAACCCTCGAGAGACGACGGGAGTAGAACTAGTAAGGTCCGGCCGAAACTAAAAGCGGAGCCTGTCGTGAAGATGTACTCCGAGGAAGACGAAAACGATTGGGGGCTCAACAGATCTGCGAGCTCCTCCATGGAAAGTCTCAACGAGGTGAGGCAGAAGAACAGGGAGAAGGTAGACCGGTTAGCAAGGAAGCATTCGAGAGACGACATGTCAGAAATAGAGCTAGCCCTGCGGAGAGCAAGCGCCTCCAGTGATAGTGACAAGGAAAGAAGACGTGCAAGAAAGAAGCGGAAGAGTACTAAGTCGAAAGAGAGAGCAGAGGCGAACGCGGAGCAAACACAGCACGAACCAGGGACTCAGACGAAGATATCGAAATGTACTTTGACAGAAGAAAAAATAGACACATCGCCGAGAGTGCTCACGGCGAGAGGAGGACTCTTCTACGCGGGGAAGTTGAGCGCTGTACAGGCCCCTGACGTGTACGCAATAACATTAGACGGAGAGAGAGGAAATAAACCGCACATACTGTCGAGGGAGGAAACATTGAGAGATGCCACAGTTCTCTGTAAACCATATCCTCTACTCTCCCAGATCCTGGAGGTGAGTCCGTCTAGTGTCCGGGAGCTTCCATCCGGTACCCGTGTATGTGCGTACTGGTCCCAGCAGTACCGCTGTTTGTATCCGGGAACAGTGGCCGTGTCTTCCCCTGACCCACACCATGACAAATTTGTCGCGGTGGAATTCGATGACGGTGACTCCGGTAGGATAGCCATCGAAGATATCAGATTCTTGGAACCCAATTATCCTATCGTGGAATACGAAAACACATTGTTTACTCTGAGCAAACGGCGGCGTAATACGAGCGTGACGGAAGATAAAAAACACTCGACGGCTTCCACGAGCAATGACGTCAAGAACGAAGCACAGAACGATGGACAGAAGGAGGAAGAAGATCGACACAGAGATAGGAAGAAGTTAAAGAAACATCGCAAAGAAAAGATGAGGCGACTGACGAGCGAGGACGGACCCGGCTCAGAATATATAAAGAAGAAGAAAAAGAAACACAAGTGCTGCGAGGAACACGGGAAGCATCGCAGGCATCACAAGAAACATCACCGGAAACACAAGAAGAGACATCACTCAATATGCAAGGAACATTCCAGTTCCTCGGGCGACGATCACAGACAGAAATCATCCTCGGACTACATGGACTCCAACAAGTCCAACGAGGACTCGCTCGACTCCAACGACCGCCTCTCCACTCTCATAGCGGTCGAGAAGTCTCCGGAAGAAAACATGAAAACTGTCATCAAGAAGGCCGTGCTGTCCAAGAAGAGTCTAGTGAAGAATGCGGTGCTGGATTTCAGTAATTTGAATAAAATAACACTCAAGAAGGAAGATGCGAAGGATAACCTGAAGGATAGCGGGATCGGCCTGGAGGAGCCGCTCCCAGAGACCGCGGCAGCGTCCACGTCCACCGATACTTCTAAAAAGAAGGCGAAGAAGCGCACGGTGTCCTCTACATCCTCGGACGGCGGCGGCGGTGTCAGCAAGATGGCGGCCTTCCTCCCGGGGGGAGCGTTGTGGAGGTGGCACGGGCCCGCCTACAGGAGGACCACCAGGCCTCGGCACAGGAAACTATTCTACAAGGCCATACAACGCGGGGAGGAAATACTACATGTGGGTGAGGCGGCGGTGTTCCTGTCCACCGGCCGCGCCGACCGCCCCTACATAGGACGGATCGCGGCCCTTTGGCAGGCCCGAGGTGCCATGGCGGTCAGGGTACACTGGTTCTACCACCCTGAGGAGACGGCCGGCTGCCGAGACTTGAAGTACCCGGGCGGGCTGTTCGAGTCCCCGCACACCGACGAGAACGACGTCCAGACGATATCCCACAAGTGTGAGGTCCTGCCCCTGGCACAGTACCAGGAGCGGCTGGGGGACGACCCGGCCCGGTACAGCACCGTGTACGACAACAACGACGTGTACTACCTCGCGGGTCACTACGACCCCACCCAGCAGGCCCTCACCATGGAGCCGCACATACCGCTGCAGGACAACTCCTAG

Protein sequence:

>DPOGS201152-PA
MCGTRRVVEFKQEAQVSMESMMVGGMGGVGVGSVGSGMSSVGGFQLVREPSSGALLLLPAPPDLPHAVVWGGVPYPSTPLLLPPAPHPSHHLQLLPGDLLASTATLQHTHTHSTRLVTLAPAPPAQPHPHPVDKRKPLMQPIITPHTLIKIEPEPPQEKPQAFAPEPVQQPILTTHLYYQPDFQEQACRPQGPPAAPQTPPPEVPAHKDASNQTDHIDEDDDSPINADEDEREVACVGVAHYPEGSNIIQIQPLHQSIDGAEETTLEGLVAATVVGAVEKAINAMEREEAADNSSSRGSHDLVIDTAGSMTPLQAQQRPPVDVSGLELLSNSIEQFERTTPSNHAAGSDQAPLTIDTRPSTKINILIKTSPRSPSQDDDVVETHKIRFQFPLVETGDSDKPSLDGLGLLCALAEQRFMEEVEESPSPNMPSTSRTFIKTELLSPTEREREREISSELKKERHRHRDDASGERRRKRSTDKEERLRHKLEKMRRHKRERKDTDKSEGGELEASLRRVTACSCGMVNCSHTSSVPSAQALVNAMEKDMRERLQDLQRQCDEKRAQLDALTPPLPALVTPAPCLQLSASPALSPDSDRGSSKKRKVGRPRKVSSPDSTETIVAKKPKSKNTLVGYLLAKGKLKGNILYSKGEPSRDDGSRTSKVRPKLKAEPVVKMYSEEDENDWGLNRSASSSMESLNEVRQKNREKVDRLARKHSRDDMSEIELALRRASASSDSDKERRRARKKRKSTKSKERAEANAEQTQHEPGTQTKISKCTLTEEKIDTSPRVLTARGGLFYAGKLSAVQAPDVYAITLDGERGNKPHILSREETLRDATVLCKPYPLLSQILEVSPSSVRELPSGTRVCAYWSQQYRCLYPGTVAVSSPDPHHDKFVAVEFDDGDSGRIAIEDIRFLEPNYPIVEYENTLFTLSKRRRNTSVTEDKKHSTASTSNDVKNEAQNDGQKEEEDRHRDRKKLKKHRKEKMRRLTSEDGPGSEYIKKKKKKHKCCEEHGKHRRHHKKHHRKHKKRHHSICKEHSSSSGDDHRQKSSSDYMDSNKSNEDSLDSNDRLSTLIAVEKSPEENMKTVIKKAVLSKKSLVKNAVLDFSNLNKITLKKEDAKDNLKDSGIGLEEPLPETAAASTSTDTSKKKAKKRTVSSTSSDGGGGVSKMAAFLPGGALWRWHGPAYRRTTRPRHRKLFYKAIQRGEEILHVGEAAVFLSTGRADRPYIGRIAALWQARGAMAVRVHWFYHPEETAGCRDLKYPGGLFESPHTDENDVQTISHKCEVLPLAQYQERLGDDPARYSTVYDNNDVYYLAGHYDPTQQALTMEPHIPLQDNS-