Monarch geneset OGS2.0

DPOGS200133
TranscriptDPOGS200133-TA3426 bp
ProteinDPOGS200133-PA1141 aa
Genomic positionDPSCF300128 - 643522-651933
RNAseq coverage824x (Rank: top 16%)
Annotation
HeliconiusHMEL0094380.093.29% 
BombyxBGIBMGA002779-TA0.085.39% 
Drosophiladre4-PA0.070.92% 
EBI UniRef50UniRef50_Q8IRG60.071.17%FACT complex subunit spt16 n=39 Tax=Eumetazoa RepID=SPT16_DROME
NCBI RefSeqXP_624006.20.077.46%PREDICTED: similar to dre4 CG1828-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3504179440.078.44%PREDICTED: FACT complex subunit spt16-like [Bombus impatiens]
NCBI nr blastxgi|3838552660.075.80%PREDICTED: FACT complex subunit spt16-like [Megachile rotundata]
Group
Gene OntologyGO:00099873.6e-59cellular process
KEGG pathway 
InterPro domain[179-438] IPR0009943.6e-59Peptidase M24, structural domain
[532-692] IPR0139537.1e-57FACT complex subunit Spt16p/Cdc68p
[825-967] IPR0137193.4e-24Domain of unknown function DUF1747, eukaryote
Orthology groupMCL11902 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200133-TA
ATGTCCAATATATCTTTGGATAAGGAAACGTTTTATAGGCGTATGAAAAGGCTATATGCGGCGTGGAAGGCTGCTGCTGCTGATTCTAAGAGTGATGATGTTTTGGCCAAATGTGATTGCTTGGTGTCTTGCGTTGGCGTGGATGAAGATACTCTATACAGCAAGTCAACCGCATTACAGACTTGGCTCTTTGGATATGAACTTCCAGACACCATAACAGTTCTGACTGAACAGAGCATGTGTTTTCTGGCAAGTAAAAAGAAGATTGAATTCCTCCGCCAAATCGAAAATGGTAAAGAAGAAACAGATCTACCTCCAGTAAAACTTTTAATAAGAGATAGAAATGACCATGATAAAGAAAACTTTAATAAGCTTATACAAGAAATAAAGAAATCTAAATCCGGCAAGACTCTGGGTGTGTTTGCCAAAGACAATTATCCAGGGGAGTTCTGCGAGAGTTGGAAATCTGCAATGAAGGCGGAGAAGTTTGAAAATGTGGATATCAGTTCATCCGTAGCTACATTCATGGCACCAAAAGAAGATTCAGAAATAATCACCATCAAAAAGGCCTGCCTTGTCACCGTTGATGTTTTCACAAAGTACTTAAAAGATCAAATTATGGAAATTATTGATTCGGACAAGAAAGTAAAACATTCGAAACTAGCGGAAGGTGTGGAAGCTGCTATATCAGATAAAAAATATGTAACCGGTGTAGACACGAGCCAAGTAGATATGTGCTATCCACCGATCATACAGTCTGGAGGGAATTATAGTCTGAAATTCAGTGCTGTGTCAGATAAGAATCACCTACATTTTGGTGCAATAGTATGTTCTCTAGGAGCCAGATACAAGTCATACTGTTCAAATATTGTCCGCACATTACTTGTCAATCCGACGGACAATGTCCAAAGCAATTATAATTTTCTTTTGAATTTGGAAGAAGAGGTCATGAAGCATCTTGTGTCTGGTGCCAAGCTGTCAGCCGTTTATGAAGCTGGTTTGGCATTGGCAAAGAAAGAAAAACCTGAATTAGTGGACAACCTCACAAAGACGTTTGGATTTGCAATGGGAATAGAATTTCGTGAAAGTGCCATAGTTATTGGACCGAAAACCGCAGTTGTTGCAAAGAAAGGCATGGTCTTTAACATTAATATTGGTTTGGCAAATTTAACCAACTCTGCAGCAACGGATAAAGAAGGAAAGACTTATGCCCTATTCATTGGTGATACTGTGCTCGTGAATGATGAACAGCCAGCATCGCTGCTAACACAATCCAAGAAGAAGATTAAAAACATAGGAATATTCCTTAAAGATGACGATGAAGAGGAAGAGGAGGAGAAAGAGAATAAAACAGAAATTTTGGGTCGCGGTAAAAGGACGGCAGTTATTGAGTCGAAGCTTCGGACTGAACATTCTTCAGAGGACAAACGTAAGGAGCATCAGAGAGAATTGGCGATAGCTCTCAACGAGAAAGCTAAGGAGAGACTGGCGAAACAGTCGAGTGGAAAAGAGGGAGAGAAGATAAGGAAGAGTACAGTCTCGTACAAAAGTGTCAGTCAAATGCCCAGAGAGAACGAAGTTAAAGAGTTGAAATTATACGTCGATCGTAAATATGAAACAGTAATATTGCCGATATTCGGCGTGCCGGTACCATTCCATATATCTACAATTAAAAATATATCTCAGTCTGTGGAGGGCGACTATACATATTTGAGAATCAATTTCTTCCACCCGGGTGCCACTATGGGCAGAAACGAGGGTGGCAACTACGCGCAGCCTGACGCGACCTTCGTTAAAGAAGTTACATACCGCAGTACAAACACTAAAGAGCCAGGAGAAATTTCACCTCCATCATCAAACCTAAACACTGGATTCCGGTTAATAAAGGAAGTTCAGAAGAAGTTCAAAACGCGAGAGGCGGAGGAGAGGGAGAAGGAGGACTTAGTTAAACAAGATACTCTCGTTTTATCCCAGAACAAAGGAAATCCCAAACTGAAGGATTTATACATCAGACCTAATATAGTCACAAAGAGAATGAGCGGGTCTCTAGAAGCGCATTCGAACGGTTTCAGATTCACGTCAGTGAGAGGAGACAAAGTTGATATTTTATATAACAACATCAAAAACGCATTCTTCCAACCGTGCGATGGAGAGATGATCATTCTATTGCATTTCCATCTGAAGCACGCTATTATGTTCGGGAAGAAGAAACATGTCGACGTGCAGTTCTATACCGAGGTCGGTGAGATTACTACAGACCTGGGCAAACACCAGCATATGCACGACCGTGACGACCTCGCCGCCGAGCAGAGCGAACGGGAACTGAGACACAAACTGAAGATAGCTTTCAAAAGTTTCTGCGAGCGCGTCGAGAACATGACCAAACAGGAAGTCGAGTTTGACACGCCGTACAGAGAACTCGGTTTCCCCGGAGCGCCGTTCCGTAGTACTGTCCTCCTACAACCGACCTCTGGAGCCCTCGTCAACCTGACCGAGTGGCCGCCCTTCGTCATCTCGCTGGAGGACGTTGAACTCGTTCACTTCGAAAGAGTACAGTTCCACCTCAAGAACTTCGATATGGTTTTCGTGTTTAAGGATTACGCCAAGAAAGTCGCCATGGTCAATGCTGTCCCCATGAACATGCTCGATCACGTCAAGGAGTGGCTGAACTCGTGCGATATCCGGTATTCGGAAGGTATCCAGTCTCTCAACTGGACAAAAGTCATGAAAACCATTACTGATGATATCGAAGGTTTCTTCGACAACGGCGGCTGGTCTTTCCTGGACCCCGAGTCTGATGCCGAGAACGAGGAACAGCACGACGATGAATCTGAAGAGGAGGATGATGCGTATGAACCGACGGATGCTGAGTCGGAAGAGGAATCCGAAGATGACTCGGAGTACGACTCCGAGGCTTCGGAAATGTCCGACGACTCCGGCGACAGCGACGGTGGTGAAGAGGACGAAGAATCTGGGAAAGATTGGTCAGATCTTGAACGCGAGGCCGCCGAAGAGGATAAGAAGGAACGCAATTACGACAGACCGTCGACGGACTTTGATCGGAAACGCAAAGGCGGGAGAGACAGACACCGCTATGACGAGGACCAAGGCAGCAAGAAGAGCAAACACGACAAAAGTTCACACCACAAAAGCTCCAGTTCAAACCACAAAAGCTCGAGCTCAAACCACAAAAGCTCGAGTTCAAACCACAAAAGCTCAAGTTCAAATCATAAAAGTTCAAACCACAAGAGCCCGTCAAAGCACAGCAGTGACAGCCCTTCGAAGAGCAATAAGCACAAGTCCTCCCACGACCGTTCCCGTGACCACAAATCTAATGGCAAGTCGAACGGTGATCACAAGTCGCACAAGAGATCACGTGACGACAGTCGCGAACACGAACGATCCTCTAAGAAACACAAGAAATAA

Protein sequence:

>DPOGS200133-PA
MSNISLDKETFYRRMKRLYAAWKAAAADSKSDDVLAKCDCLVSCVGVDEDTLYSKSTALQTWLFGYELPDTITVLTEQSMCFLASKKKIEFLRQIENGKEETDLPPVKLLIRDRNDHDKENFNKLIQEIKKSKSGKTLGVFAKDNYPGEFCESWKSAMKAEKFENVDISSSVATFMAPKEDSEIITIKKACLVTVDVFTKYLKDQIMEIIDSDKKVKHSKLAEGVEAAISDKKYVTGVDTSQVDMCYPPIIQSGGNYSLKFSAVSDKNHLHFGAIVCSLGARYKSYCSNIVRTLLVNPTDNVQSNYNFLLNLEEEVMKHLVSGAKLSAVYEAGLALAKKEKPELVDNLTKTFGFAMGIEFRESAIVIGPKTAVVAKKGMVFNINIGLANLTNSAATDKEGKTYALFIGDTVLVNDEQPASLLTQSKKKIKNIGIFLKDDDEEEEEEKENKTEILGRGKRTAVIESKLRTEHSSEDKRKEHQRELAIALNEKAKERLAKQSSGKEGEKIRKSTVSYKSVSQMPRENEVKELKLYVDRKYETVILPIFGVPVPFHISTIKNISQSVEGDYTYLRINFFHPGATMGRNEGGNYAQPDATFVKEVTYRSTNTKEPGEISPPSSNLNTGFRLIKEVQKKFKTREAEEREKEDLVKQDTLVLSQNKGNPKLKDLYIRPNIVTKRMSGSLEAHSNGFRFTSVRGDKVDILYNNIKNAFFQPCDGEMIILLHFHLKHAIMFGKKKHVDVQFYTEVGEITTDLGKHQHMHDRDDLAAEQSERELRHKLKIAFKSFCERVENMTKQEVEFDTPYRELGFPGAPFRSTVLLQPTSGALVNLTEWPPFVISLEDVELVHFERVQFHLKNFDMVFVFKDYAKKVAMVNAVPMNMLDHVKEWLNSCDIRYSEGIQSLNWTKVMKTITDDIEGFFDNGGWSFLDPESDAENEEQHDDESEEEDDAYEPTDAESEEESEDDSEYDSEASEMSDDSGDSDGGEEDEESGKDWSDLEREAAEEDKKERNYDRPSTDFDRKRKGGRDRHRYDEDQGSKKSKHDKSSHHKSSSSNHKSSSSNHKSSSSNHKSSSSNHKSSNHKSPSKHSSDSPSKSNKHKSSHDRSRDHKSNGKSNGDHKSHKRSRDDSREHERSSKKHKK-