Monarch geneset OGS2.0

DPOGS214217
TranscriptDPOGS214217-TA2937 bp
ProteinDPOGS214217-PA978 aa
Genomic positionDPSCF300014 + 756281-764043
RNAseq coverage410x (Rank: top 30%)
Annotation
HeliconiusHMEL0022490.069.28% 
BombyxBGIBMGA005944-TA2e-7571.08% 
DrosophilaPNUTS-PD7e-6137.14% 
EBI UniRef50UniRef50_D6WL763e-12941.07%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WL76_TRICA
NCBI RefSeqXP_973497.15e-12941.01%PREDICTED: similar to GA17975-PA [Tribolium castaneum]
NCBI nr blastpgi|2700070461e-12841.07%hypothetical protein TcasGA2_TC013494 [Tribolium castaneum]
NCBI nr blastxgi|3504105910.039.70%PREDICTED: hypothetical protein LOC100741303 [Bombus impatiens]
Group
Gene OntologyGO:00056342.4e-12nucleus
GO:00036772.4e-12DNA binding
GO:00063512.4e-12transcription, DNA-dependent
GO:00082707e-05zinc ion binding
GO:00036767e-05nucleic acid binding
KEGG pathway 
InterPro domain[57-146] IPR0179232.4e-12Transcription factor IIS, N-terminal
Orthology groupMCL15021 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214217-TA
ATGCCCCGTATAGATCCTATCCAACTATTAAATTGCCTCAGTGTTCTTCTCTCACCTGATGGAGGCATTAAAAGTAGGGATGAAGTGCCAAGATTAGTAAGGCTAATGACCAAGTTTTCAAAGAAGCTAGTTTCTAAATGCATTTATATCCAAATTCTTAAGTGTACAGAAATTGAGTTATTAGGTTTATTTATGGGATCCAAAGGTTGGAGACTAGTACACATGTGGCTAACAGAGAGCATTGTTGCCAAAAATTGGCCTTTAGTTAGAGAGTTATTGGAATTACTGTTATTGTGTCCAGTAGATATAGAAAGGCTAAAGACAAATAACTGTCCTAAACTAGTTAAGGAGCTTTCAAAAGATGGAAATCACTCTGCTATACGAGCATTGGCAACAAAATTAGTTGAACAATGGCTCAAAACTGTGAAAGGTGGAGAACAAGTAGTGCCTATGCATATTTCTGAAATTCAACAAATAATAACAGACTCACAATTGGAAACTAAATTAGAAGATTTAAAAAATGATGCATCGGACAATGAAAAAGATTTAGATGATTGTGACTCTTCTAATTTGGAAAATAATAATAAAGTAGAAGTAGATGAGGAAACTCTAGAGACTCCTTGTAAAATAGACATCACTGATAAGGATGAGCAAAGTATGGAAACTGAAGTTTTACCAGTTTTAAAAATAACTTTAAAGAATGGAAAACAAATCCTATCACAAGTGGAAGATGACAGCAAATCTTCAGAAGATTCAGTTGACAAGGACAACAATAAAGAAAAACAGAAAAGTAGAAGTAAAGACAAAAGCGAAAAAAGCAGTAGTAGTAGTTCATCAAAATCTAGCTCCTCAAAACATTCAAGTAGGTCCTCATCTAGTGATAAAAGGAGTAATCATAAGGAAAGTTCTTGTAAACATAGTAGTAAGGATAAATCAAGGGATAAAGATAAAAGAACTTCCCATAGTTCTAGGTCCAAATCTAGTAGTAGCAGTTCTAGTAGTAGTAGACGTTCCAGTTCTTCCAGTAGCAAATTAAAGGATGAAAAAAGTAGCAAAGATAAAAGTAAAGATAAGGATAGCAAACAGAGTTCAGAAAAATCTGATGATAAGGTGCAAACTGCTATTAGCAAATTAGGAAAAATACCAAAATTGAGTGATGTTAAGAAAGAAAAGCTTTCTATTTCTATAGAGGTACGAAAACCAGATGAACCAAAACCAAAAACTGTGAAAACATTTAATTCTAAGTTTAGGAAACATGGATTAGAAGAAGAAATGAAACCACCTCCATCAAGGGCTACTTTACTAAGCAACAAGAAACCACCACCCGCACTACCACCAACAGTCTCAATACCTAAAAGACCATCACCTGTTCATAATGACACTCCTCCAGAGAAGAAACCAAAACCATCTATTGATCTTGAAAAACCAGGATCAATAAAACTTATACCACCTAAACCAAAGCCCATGGTACTCCAGGAAAGCGATATGTTTATGGATGCTTTAAATGCCTCTGCTACTAATAAAAAGGAACCGAAGAAGAGGAAAAGGCGCACCAGTGGTTCCAAGGATGGGAATTCGCAGAATGATGGATCACCTCCTCATACACCAACTGGTGTTACCAGTCCTAATGGAGACAGCAAATCAGTGCCGCCAAGATTTTATCAAGATACTCTTGATGAAGAAGATAATAAGGATAAAGTGACAGACAAAAATGAAAATGAAGGTGAAAATTCACCAAAAGCAGCTATAAAAGATAAAAGTACGGAGGATAGCATGGAGACAGAACCACACACGTTGACTGTAAATGGTATAAAAGGTGTCCTATGCTATCACAAACGAAAAGGGCCGAAAAAAAGTATTCGATGGAAACCAGATTCTGAACTAGAAGAAATACAATACTTTGAACTTGATGAGACTGAAAGAGTAAATGTTACTAAAACCTTTACTGACATGAAACATCTAGAAAGAATAAATGAAAGAGATGCCTTCCAAAAAGCTAGGAATCTCAGTAATGATGACATAATGGAAGAGAGAACTAGTTGGAGACCTTTAATACCACTTGATACAGATGGTCAAATCCAGATAGAATATGGCAAAAATAGCAAAGAAAAAGATATACAAGCCATACGTCAAAAAGGTACCCTGCAACCACTATACTTCCATAAAGCTATGATTCCTGACTCACCCCATGAGCCAGATGTTGAAACTCACACTTATGCTGAACCAACAGTTATTCCACTAGAAGATGTTACAGGAAACCAAGATAATATAAGTGACTTTAGAAACATGCCTTGGCCTGAACCAAAGGGTAATGCACCTCCAACAAGCAATAATAATTTGAATGTTCCATCTATGTTTCCACCTAATATGTCTCAATACCCAAACAATTATCCTAATCCACAATTTCCTGGAGGACCACCCGGGTTCCAAGGTCCCCCTATGATGTCTGGAGAATGGCAAAATGGAGTTCCTCCCATGTTGCCAAATGGTATGCCTGGTCCAATTGGAAACAATGTCATTGGACCAGGTGCTATGCCCCCAGGTTCTATGCCACAAGGTAATGTGCCACCAGGTCTATTGGGACCACCGGATAATATGATGATGGGTCCAGATATGTTTGGTGGGCCAAACCATATGTTTCCTGTTCCTCCAGATGGTTTTAACATGCAGCAGAATATGTTCCCTGTCGATTTCAATATGTCAGGTCCACCAGGTCCAGATTTTCCTGGTAATTTCAGAGGAGCAATACGCGGCCGTGGTTCTGGAGGCCATTGGAGAGGTAAAGCAGCAGGTGGTTGGGATGGACCTCCTAGGGGTCGGGGAGGTGGTCGAGGCGGAGTCCGCAAAGCTGTGTGCATTTATTTTCAAAGAAAAGGATCTTGTCGACAAGGTGAAAATTGTACATTTTTGCATCCCGGAGTTAATTGCCCCTTTTAA

Protein sequence:

>DPOGS214217-PA
MPRIDPIQLLNCLSVLLSPDGGIKSRDEVPRLVRLMTKFSKKLVSKCIYIQILKCTEIELLGLFMGSKGWRLVHMWLTESIVAKNWPLVRELLELLLLCPVDIERLKTNNCPKLVKELSKDGNHSAIRALATKLVEQWLKTVKGGEQVVPMHISEIQQIITDSQLETKLEDLKNDASDNEKDLDDCDSSNLENNNKVEVDEETLETPCKIDITDKDEQSMETEVLPVLKITLKNGKQILSQVEDDSKSSEDSVDKDNNKEKQKSRSKDKSEKSSSSSSSKSSSSKHSSRSSSSDKRSNHKESSCKHSSKDKSRDKDKRTSHSSRSKSSSSSSSSSRRSSSSSSKLKDEKSSKDKSKDKDSKQSSEKSDDKVQTAISKLGKIPKLSDVKKEKLSISIEVRKPDEPKPKTVKTFNSKFRKHGLEEEMKPPPSRATLLSNKKPPPALPPTVSIPKRPSPVHNDTPPEKKPKPSIDLEKPGSIKLIPPKPKPMVLQESDMFMDALNASATNKKEPKKRKRRTSGSKDGNSQNDGSPPHTPTGVTSPNGDSKSVPPRFYQDTLDEEDNKDKVTDKNENEGENSPKAAIKDKSTEDSMETEPHTLTVNGIKGVLCYHKRKGPKKSIRWKPDSELEEIQYFELDETERVNVTKTFTDMKHLERINERDAFQKARNLSNDDIMEERTSWRPLIPLDTDGQIQIEYGKNSKEKDIQAIRQKGTLQPLYFHKAMIPDSPHEPDVETHTYAEPTVIPLEDVTGNQDNISDFRNMPWPEPKGNAPPTSNNNLNVPSMFPPNMSQYPNNYPNPQFPGGPPGFQGPPMMSGEWQNGVPPMLPNGMPGPIGNNVIGPGAMPPGSMPQGNVPPGLLGPPDNMMMGPDMFGGPNHMFPVPPDGFNMQQNMFPVDFNMSGPPGPDFPGNFRGAIRGRGSGGHWRGKAAGGWDGPPRGRGGGRGGVRKAVCIYFQRKGSCRQGENCTFLHPGVNCPF-