Monarch geneset OGS2.0

DPOGS201526
TranscriptDPOGS201526-TA2748 bp
ProteinDPOGS201526-PA915 aa
Genomic positionDPSCF300006 + 1289428-1292175
RNAseq coverage1029x (Rank: top 12%)
Annotation
HeliconiusHMEL0090920.085.81% 
BombyxBGIBMGA002705-TA0.079.70% 
Drosophilabun-PG3e-4654.10% 
EBI UniRef50UniRef50_E2BNB41e-6652.31%Protein bunched, class 2 isoform n=6 Tax=Formicidae RepID=E2BNB4_HARSA
NCBI RefSeqXP_395024.33e-6954.61%PREDICTED: similar to bunched CG5461-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3800158195e-6854.61%PREDICTED: uncharacterized protein LOC100867598 [Apis florea]
NCBI nr blastxgi|2420254722e-9235.67%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00063555.5e-27regulation of transcription, DNA-dependent
GO:00037005.5e-27sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[53-866] IPR0005805.5e-27TSC-22 / Dip / Bun
Orthology groupMCL26500 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201526-TA
ATGGCTGACAATCTGATTCAAAAGTCACATAAAACTAGCGAAAAAAACAAGTATAACAACGTTGTTCATCGTACTACGAGTGAATCTCTTCGACTAAATGAGTCTGAGAAGGGAGTGACTCACCCGACGAGTCTTCAATCCGCCCATAACCCAAGGAAAATATCTTCCTTTCAGATTACGAGTGTGACTGTTGGGTCTCGGGTGAGCACTGATGCAGGGGAGGACTCCGCGGACGATCTAGACGAATCTCACACCGATGACATCTCAAGGGTGACCGACATAGAGAATGAAACACCGAGCTACTCTGAAGACACCTTTTCGAAAGACGACGTTTTCTACAACGCGTCAAGTGCATCGCTAGGTTGTGCGCCCGTCATTCCGACCAGTTCGCAGTACGGACTCGCGATCGTCGGTCAGGACGCTAATACCAATCAAGTAGGAGGAGCTGTGCCAAATAGTAATAACACGGAAGTGAATGACATGCACGTCAGTGTCACTAACGCCGGAACAGGCAGCATCATCAATCTTATAGGTAATTCTAAGCCTCAAGAAGGCATGAAGGAGATCCAAGAACATGTCAGAAATGAGAGGTTTAAAGTTGTTAAAATTGAAAGTACTGAACCTTTCCGCCGTGGCAGATGGATGTGCATGGACTATTTAGATCACACTACTACACAACAAAATGCCCCAATAACTTTGAACAATAATTTGGATGTCACAGAAACCAATGCTTTGCAAGCACCTGATAGTGGAGTTGTTATTAATGATAGTCAACATGATGATATGTGTAATGATTTGGCAAATAAAGTGCCTAATGATCAAGTGAGTGCTCCTATTCAACAAATGGATCAATGTGTGCAGAAACAATTTCCGATGGCATCTCCGGGGCAATCACTTACTCAACCCATTAATATGGCCCAGCAACCTATGCCTGTCACTCAGTCAGTATCTGTACAATCTCCACCATTAGAGATGCCACAGCAAATGATTAATCAGAATATGCAATCAAACCAACAAGTGGCTCAACAACATCCTCAAAGTATGACACATATCACTATGCAGAATGCACCGCAGAGCCATCAACAGCCTACCCAACAGCAGCAGGTTCAACAGATTCCACAAAGTTTTCCACAGCATCAACTTCAACAAGTTATTGCACAGTCTCAAAGTATGGCAATGCAACAAATACCTATGCATCAACAAATGCCGCAGCAAATGCAGCAAATTCCCAATCAACAGATGCAACAAATGGGTCAACATTTGCCTCAAGCTCAGATAACTAATATGCAATTGCAGCAGCAGATTCCACAAATGCAAGGGCTGTCTAATCAGGGGCAACTCTCTCAAATTCCGGGCCAGCAGCCACAAATACAACAAATGCAGATGCAACCTATGCAGGGACAGCCAAATCCACAACAAATTCATCAGATGCAACAAGCCCAGATGCCTAATATGGGTCAAGGACACCATCTTCAAGGTCAACAAGCTATGGGACCTCAGCAAGCCCAACTGCAGCACTTGTCTAGTCAGGCACAAATTCAAGCACTGCAGAATCAACAGATGCCAAACCATATTCAGCAAATGCAATTACCAACTCAACTCGCTGCGACTAATCAACAAATATCACAAATGCAAACCCAAATGCATCAACTTCCATCACAGCCCCAACAATCAGTGGTACCAGGAATGCATCCACAAATGCAGCAGCAGGGAATGGCTGTGCCGCAGCCACAATACAATCAAGCGGTGAATCAGCAAAGTAACGCTCAGTCGATGATTAGTGGCAGTCTTCCATCATCACAGCAACCTATGGTTCAAACACAACACAATGTTCCTCAATACCATACCCAACAATCTCAGATGACACAGCAGGGCCAGACATTGCCACAAGAGGTGCTGACTAGTATTGTAACATCACAGCAAGGTGCAACTCTTCCTACTAATTTGCAACCAATGGCCTCGCAGCCACAAGGATCAACTTTACCAGCTAATCTCCAAAGTCTAACTGGGCAACAGCAGGCTCCCGTTCACACTATGGCCCCACAACAGACTAATGTACCACAAGGCAATATACAGATTATGAGTGCTGCACCACAGAATATGCAGATGACTTTAGATGGCAATCAGGCAGGAATGCAAATGCCAGCCTCTGATCCCATTTACATGCAACCACCCAATGTTGGGCAACAGATGCCAGCACATATGACCCAACCGCAAACCATGTTGCAGCAACAGGTGAGTGGCCAACATACAACACAGATGGGTGGTGTGCAATATGTTCCTAATCAGACAGTGCCTCAAGTGTCTATGGCACATCAGAACATTCCGATGTCTATGCAGCAAAGCATGCCGATGGGAATGGGCGGAGGAGTTCATGCTAGTGTAGTGCAATCTCAGACGAGCATGGGTCTGGGTTATGGGGGTGTGGTACCCGTGATGTCACAGACTAGCGTGGCGCCAGTCGAGGCGACCGTGTCGGGGACTAACTCGCCAATAGTGTCAATGCCAGTAAATTCCACCGCGTACGTGTCTAATGCTCCACAGCCGGGACATGACAGTCAGGGTTTCGGCAGCCCAGTGAGTGCGGTGGTGTCTCACGCCATTAGTGGCTCGGTGGTGAGCAATGTGAATGTGAATGCGTGTGATAGCAGTGCACCAGAATCGGTACCAGACGGAATGCAGGCTGGCGACACTGGCGATGGTAAAGAGGAGCCGCAACCAGCTGTGCAACCAGACGATGAAAGGTAA

Protein sequence:

>DPOGS201526-PA
MADNLIQKSHKTSEKNKYNNVVHRTTSESLRLNESEKGVTHPTSLQSAHNPRKISSFQITSVTVGSRVSTDAGEDSADDLDESHTDDISRVTDIENETPSYSEDTFSKDDVFYNASSASLGCAPVIPTSSQYGLAIVGQDANTNQVGGAVPNSNNTEVNDMHVSVTNAGTGSIINLIGNSKPQEGMKEIQEHVRNERFKVVKIESTEPFRRGRWMCMDYLDHTTTQQNAPITLNNNLDVTETNALQAPDSGVVINDSQHDDMCNDLANKVPNDQVSAPIQQMDQCVQKQFPMASPGQSLTQPINMAQQPMPVTQSVSVQSPPLEMPQQMINQNMQSNQQVAQQHPQSMTHITMQNAPQSHQQPTQQQQVQQIPQSFPQHQLQQVIAQSQSMAMQQIPMHQQMPQQMQQIPNQQMQQMGQHLPQAQITNMQLQQQIPQMQGLSNQGQLSQIPGQQPQIQQMQMQPMQGQPNPQQIHQMQQAQMPNMGQGHHLQGQQAMGPQQAQLQHLSSQAQIQALQNQQMPNHIQQMQLPTQLAATNQQISQMQTQMHQLPSQPQQSVVPGMHPQMQQQGMAVPQPQYNQAVNQQSNAQSMISGSLPSSQQPMVQTQHNVPQYHTQQSQMTQQGQTLPQEVLTSIVTSQQGATLPTNLQPMASQPQGSTLPANLQSLTGQQQAPVHTMAPQQTNVPQGNIQIMSAAPQNMQMTLDGNQAGMQMPASDPIYMQPPNVGQQMPAHMTQPQTMLQQQVSGQHTTQMGGVQYVPNQTVPQVSMAHQNIPMSMQQSMPMGMGGGVHASVVQSQTSMGLGYGGVVPVMSQTSVAPVEATVSGTNSPIVSMPVNSTAYVSNAPQPGHDSQGFGSPVSAVVSHAISGSVVSNVNVNACDSSAPESVPDGMQAGDTGDGKEEPQPAVQPDDER-