Monarch geneset OGS2.0

DPOGS208519
TranscriptDPOGS208519-TA747 bp
ProteinDPOGS208519-PA248 aa
Genomic positionDPSCF300064 + 20946-23373
RNAseq coverage461x (Rank: top 27%)
Annotation
HeliconiusHMEL0028101e-9776.44% 
BombyxBGIBMGA008370-TA7e-7760.68% 
DrosophilaUsf-PB3e-1275.00% 
EBI UniRef50UniRef50_D2A3762e-3137.90%Putative uncharacterized protein GLEAN_07937 n=2 Tax=Tribolium castaneum RepID=D2A376_TRICA
NCBI RefSeqXP_974456.23e-3237.90%PREDICTED: similar to USF [Tribolium castaneum]
NCBI nr blastpgi|1892362645e-3137.90%PREDICTED: similar to USF [Tribolium castaneum]
NCBI nr blastxgi|2700058251e-3135.57%hypothetical protein TcasGA2_TC007937 [Tribolium castaneum]
Group
Gene OntologyGO:00056346.4e-20nucleus
GO:00063556.4e-20regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[137-217] IPR0115986.4e-20Helix-loop-helix DNA-binding
[137-188] IPR0010928e-17Helix-loop-helix DNA-binding domain
Orthology groupMCL17829 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208519-TA
ATGGCCAAAGTAGTAGTACTTGACGTCGATGAAAATTCTCTTGATAACAGCGCGGACAACGAATTACTAGGAATGGTAGATGAAGGATCTCTCGGAAGTGCAGATTGTGAGAGATCACAATTAGTGTCGCAATCGTTTCTGGAAAATAACGAAGAAAATAGTACGTCCTACAATTTTAAAGATCTGTCTGGCACAGTGGCGTACAAACTTATACCGATGACAGGGAATCAAACTCCCAACAACATGGCTACCGAGTCTATAACATCTGGTGTACTACCCAGTGGAGAATTCTATGTGATTGGTAATCCTGTACAAGTTTTTGGAACGGAGGCCGGACAAAAAAAAGTTGTTAGAAGGAAGAGTGCTCTGCAGAATAAAGTTATCACCGCTAAGAAGCGTGATGACAAACGACGTGCTACTCACAACGAAGTTGAAAGACGTCGACGTGATAAAATTAACAGTTGGATTACAAAGTTGGCAGCATTGGTGCCAAACTCAGGTCTACAGGATACAGCCAGTAAGGGGGGAATATTGGCGAAGGCGTGTGACCACATCACTGAACTCACAGAAAGACAGAAAAAATTAGAAAAACTAGAAGTGGACAATGACAAACTAGTGTTAGAAGTCTTAAGATTGAATCAAGAGCTTGCCGATTTGAGAAAGGAGAACGCTTCAATGAGAAGTCAGTTGGCCGACAACTGTATAGTTACAATGCAGAACAGACGAGCCCGAGGACAGAAATCTTAG

Protein sequence:

>DPOGS208519-PA
MAKVVVLDVDENSLDNSADNELLGMVDEGSLGSADCERSQLVSQSFLENNEENSTSYNFKDLSGTVAYKLIPMTGNQTPNNMATESITSGVLPSGEFYVIGNPVQVFGTEAGQKKVVRRKSALQNKVITAKKRDDKRRATHNEVERRRRDKINSWITKLAALVPNSGLQDTASKGGILAKACDHITELTERQKKLEKLEVDNDKLVLEVLRLNQELADLRKENASMRSQLADNCIVTMQNRRARGQKS-