Monarch geneset OGS2.0

DPOGS208021
TranscriptDPOGS208021-TA2244 bp
ProteinDPOGS208021-PA747 aa
Genomic positionDPSCF300203 - 286719-289853
RNAseq coverage242x (Rank: top 43%)
Annotation
HeliconiusHMEL0178070.079.00% 
BombyxBGIBMGA001494-TA0.066.67% 
Drosophilaear-PA2e-5340.66% 
EBI UniRef50UniRef50_E0VMA61e-5746.83%Neurofilament triplet M protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VMA6_PEDHC
NCBI RefSeqXP_966967.27e-6448.86%PREDICTED: similar to GA21032-PA [Tribolium castaneum]
NCBI nr blastpgi|2700009164e-6348.86%hypothetical protein TcasGA2_TC011185 [Tribolium castaneum]
NCBI nr blastxgi|1700347392e-10935.20%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00056349e-83nucleus
GO:00063559e-83regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[4-748] IPR0050339e-83YEATS
Orthology groupMCL22647 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208021-TA
ATGACAAATATAAAGGTTAATTTCGAGATCGGTCATGAGGCTTCTTTAAAATCTAAGAAAACTCCCGAAGGGTTCACTCATGACTGGGAAGTCTTCGTCCGTGGCCAGGAAGGTGCTGATATAAGTCATTTTGTTGAGAAAGTAGTTTTCTATCTTCATGAAACTTTCCCGAAACCAAAGCGAGTTGTGAAGGAGCCACCATTTTCCATAAAGGAGTCAGGCTATGCTGGTTTCGTGTTTCCAATAGAAATATACTTAAAAAGCAAGGATGAACCCAAGAAAATTCAATTCACATATGATTTAACTTTACAGCAATGTGGATTTTTAAAAGATAGGTATGTGTTTCAAAATCCAAGTGAGGAATTCAGAAGAAAACTTTTAAAAGGTGGAGGGATTCCCGTAAGTAACAGTTCTTTTTATACAAACCCCGAACAGGAAAGTAGAAGTCGAGATTCATTCACCGATGAGAAACCACAACTTGTTAGCAAACCAAAATTATCTTCAGATAATATAAAGAAACATAAAGTAAAAGAGTACAAAGATGAACAGCCGCATAAAAACATTTGTTTTGAAAATTTGTTTGGACCACCTATACAGAAACCACCGAAAGTTTCCCCAGATCCCAAGAAACTGGAGAAAAGTTCTCTATCTGCTAAGTCTGATAAGAAAGACAAAGATAGGTCTGGCTCAGATAAGAAATCGAAACATGATCACAAAGAAAGCAAACAGGATAAGGTTAAAATTAAAGAAGAGAAGAGCAAGCAGAAGGGAGAAAAAGTAAAGAATCACAATAAAGAACAGGATAGGGTTAAAGAAAAAACAGCTAAACGGCAAAATGAGAGACCCCCTTCCCCTGAACCAGCCAAGAAAAGATGTCCAAGTCCTAATAGAAAGCTGCCAAGTCCAATGCCTAGATCAAGTAGTGCCTCTAGCATAAAAGAAGAATACAAACCAAAACATAATTCTGAAAATTTTGAACACAGAAAATCTAAACTTGATGACAGAATACCGGATATAAAAGTAGAGAAAGATGTAAAGGAGAAAAAGAAGAAAGAGAAAAAGAGTCATGATAGAGATAAAGAAAGAAAAGAAAAGAAGGAGCACAAAAAAGATAGTCATAAGTCAAAAGAAGATAAGGAACCAATAAAGGAAATACCTAAAGAAATAGTCAAATCAAGAGAAGTTGTCAAGGAGAAAGAAGTTATAAAAGATTCTCCTGTGAAAGAAAAACAAATAAAACCTGAAAAGACAGTGAATAAGTTTTCTATAGAAAATTTAAGGAAGACACCTCCACCAGAAAATGTTGACAGGCATGATAATCACAAATCAAAGGACAAAGGAGACTCTGAGAGAAAACATAAACACAAGAAAAAAGATAAAAAGAGAGACGAGTCGAAAGAAAAGCACAAAGAATCTAGCAAAGAGAAAAGGCATAAACATGAAAAAGTGCGGGAAATACCTCAAGAGAAACCTGAGGTTATTGAACTTAGAGAAACACCAATTCCAAAGGAACGTCCAATGCCTGAACCAGCCTCACCTATATCTATAGACACGGCATCTCAATGTAGTTCTAAGAGTGGTATAAATAAACCTATACATATAGTGGACGATGCGAACAGCAGTCATTCGGACTCGGAAGGATCAATAATAGCTGATGAAGAAGATGTTAAAGTTAAAATCGAAAACCATTCTCCGGAACCTATTAAAAGAGAACCTTCTCCGGAACCAGAACCCGAACCTGAACCGGAACCGGAAATTGAACCTGAACCGGAGCCAGTAGTGGAACTTCCGCCAGTCCAGAAAGAAAAGTCTAAAAAACATAAAGACAAATCAAAAAAAGAAGAGAAGAGAAGAAAGAGAAAAGCAGCTGAGGAGGAAGACGCTGAAAGTAGAAGAGTTGCTAAAGCTGCGGCGACTGCTGACTCAGGACCTTCGAATAATGAAAATGATCATGGAGAAAGCAGTGGTTCAACGTCCATGGAAACCAAAGTTCAGGATAATGGTGTATCAAGTAGCTTAGGAGAAGACGCAGAACCTGGGGATCTCTCACCGGACTACATGGTACAACTCAGAGGTCTCCAGCAGAGAATTATGATGATAAAGAACAACGAAGATCTGGAAAGGGTTGTGAATCTTATTGCGGAGACTGGGCGGTATGAAGTAACTACACAGACGTTTGACTTTGATCTGTGTTTGTTAGATCGATCAACGGTTCAGCAACTGATACAACTCGTGGGTTGCTAG

Protein sequence:

>DPOGS208021-PA
MTNIKVNFEIGHEASLKSKKTPEGFTHDWEVFVRGQEGADISHFVEKVVFYLHETFPKPKRVVKEPPFSIKESGYAGFVFPIEIYLKSKDEPKKIQFTYDLTLQQCGFLKDRYVFQNPSEEFRRKLLKGGGIPVSNSSFYTNPEQESRSRDSFTDEKPQLVSKPKLSSDNIKKHKVKEYKDEQPHKNICFENLFGPPIQKPPKVSPDPKKLEKSSLSAKSDKKDKDRSGSDKKSKHDHKESKQDKVKIKEEKSKQKGEKVKNHNKEQDRVKEKTAKRQNERPPSPEPAKKRCPSPNRKLPSPMPRSSSASSIKEEYKPKHNSENFEHRKSKLDDRIPDIKVEKDVKEKKKKEKKSHDRDKERKEKKEHKKDSHKSKEDKEPIKEIPKEIVKSREVVKEKEVIKDSPVKEKQIKPEKTVNKFSIENLRKTPPPENVDRHDNHKSKDKGDSERKHKHKKKDKKRDESKEKHKESSKEKRHKHEKVREIPQEKPEVIELRETPIPKERPMPEPASPISIDTASQCSSKSGINKPIHIVDDANSSHSDSEGSIIADEEDVKVKIENHSPEPIKREPSPEPEPEPEPEPEIEPEPEPVVELPPVQKEKSKKHKDKSKKEEKRRKRKAAEEEDAESRRVAKAAATADSGPSNNENDHGESSGSTSMETKVQDNGVSSSLGEDAEPGDLSPDYMVQLRGLQQRIMMIKNNEDLERVVNLIAETGRYEVTTQTFDFDLCLLDRSTVQQLIQLVGC-