Monarch geneset OGS2.0

DPOGS215981
TranscriptDPOGS215981-TA3621 bp
ProteinDPOGS215981-PA1206 aa
Genomic positionDPSCF300078 - 432365-442025
RNAseq coverage836x (Rank: top 15%)
Annotation
HeliconiusHMEL0086800.067.21% 
BombyxBGIBMGA001211-TA0.062.83% 
DrosophilaHLH106-PB1e-6632.12% 
EBI UniRef50UniRef50_D2A1139e-9832.92%Putative uncharacterized protein GLEAN_07163 n=1 Tax=Tribolium castaneum RepID=D2A113_TRICA
NCBI RefSeqXP_974195.12e-9832.92%PREDICTED: similar to sterol regulatory element-binding protein 1 [Tribolium castaneum]
NCBI nr blastpgi|910814733e-9732.92%PREDICTED: similar to sterol regulatory element-binding protein 1 [Tribolium castaneum]
NCBI nr blastxgi|910814734e-9332.54%PREDICTED: similar to sterol regulatory element-binding protein 1 [Tribolium castaneum]
Group
Gene OntologyGO:00056341.6e-21nucleus
GO:00063551.6e-21regulation of transcription, DNA-dependent
KEGG pathwaydre:7932747e-76 
 K07197 (SREBP1, SREBF1)maps-> Insulin signaling pathway
InterPro domain[286-356] IPR0115981.6e-21Helix-loop-helix DNA-binding
[288-338] IPR0010921.8e-16Helix-loop-helix DNA-binding domain
Orthology groupMCL10763 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215981-TA
ATGGATCCGATGGAGCCTTTAATAAACAATGATGTTTTTAATGTCAACGAAATAGCTGAAATAGAAGATTTCTTGAATGGCTGTGACGGAGATTTTATGAAAAAGTTAGAAGAAGAACTAGTATTTGCAGATAATGACACTGGTCTGTTGAGCGTGGACACTAAATTTAGTACGAATGTTTCATCACCCCAAGATTCTCCTTATTACACTGCTCCGGGGAACCCATTGGTTTCTCAACATCCGCAAAAGAGAAAACTGCCTCCATTTACTCAACAACCGAAAACAAGCCCAGTTTTAGGGGTGTATGGCAGAGAAGAAAGTTATAATTTGGAAGTTAAAGCAGAAAGCCATCAGTTGCCTGAGGTGCTACTTCAAAAAGCTCAAAACCAGACAGTTCAAACTCCGATGTTTGTGCAACAAGTTATTCCAAAACCTCAATATGTTGCCTTAGGAAGCTTACAAAAATTACCCGATGGGGTGGCATCTCTTGTTCAATTAGATTCTCCAATAAATCAAAATACTAAAGCCCAACCCGCAGCCAAGCCATTGCTGCTCCCAAATAATGCAAAAGGGGTTACACCAGTTATTTTGAAGAGCAGCGACTCAAATTTTTCACCTGTGATATTACAGTCAAATATTCTCAATCCTGAAACTCAGACTTTGATGTACACCAGTGCTCCTGTACAAGGAACAACTCAAAGTATTATTACCAATTCTGGTGCTGAAAGTCAGCCTGTACACACTTTCTTTGCTAGTAATAATGGCCCTACATTGGTTACTGGCATACCATTGGTCCTGGATGGTGACAAAATTTCCTTCAACCAATCTTCGAATGGAAGTCCTCCTAAAGTTAAAGAAGTAAAGAGAAGTGCCCACAATGCTATCGAAAGAAGATATAGAACTAGCATAAACGATAGAATAGTGGAGCTTAAGAATATGTTAGTAGGTGAAGAAGCAAAGTTAAATAAGTCAGCAATATTAAGAAAAACAATAGATTACATAAAGTACCTGCAAAATCAGAATACTAGACTTAAACAAGAGAATATAGCTCTGAAACTGGTGTGTCAGAAGTCTGGAGTGAAGGATGTTGTGTTTGATGGAGCCTACACCCCACCACATAGTGACATATCATCCCCCTACCACTCTCCCCATGGTATGGATAGCACTCCTTCATCACCAGAGAGTAAGGTCGAAGAAAAATACTCGAAAATTGTTATTGGAATGGGAGATCATTCTCGTCTAGCATTATGTGCCTTTATGATAGGGCTGATTGCCTTCAATCCATTCAGTGCTTTCTTTGGCAGTTTTATGTCCGAATCCTCTTATGATTACAACGCTCGACTTGATCAACGAAGAATACTTTCCGAAGATAGTTTTGGTGCTGGAAATGTATCGTGGGGAGCCTGGCTATTCAATATGTTTTTGATATATTTGGTAAATACAATAATTTTGGGAGGTTGTCTCATCAAACTTCTAGTGTATGGGGATTCTGTACCAAAATCACAATCTAAGGAAGCCGGCCTATTTTACAAACACAAGCAACAAGCTAATAATCATTTAAAGAAGAATGATCTGGAGAATGCTCGGAGTGAGCTGAATCGTGCTCTGTCGGTATGTGGTCGTAGTGTCCAGGCGGGTGGGTGGGGGCGTTACTCAGCGCTCACTGCTGCTGTAATGAGACAGATACTGCAACGACTTCCCTTGGGAGGCTTCCTGGCAAGACGAGCTGGAGATCTGTGGGGTGACAGTCCAGCGAGACGCGCCACTCAGCACTGGGCCAAAGAAGTGTCTATGGTGTCTCACAAATTGGCCCAATTGGAAATATTATCTAATCAGACGAGTGGCAGTAAATGTGTACTACTGGCGTTACAAGCTGTCAACCTTGCTGAAGTCACGAGCGATAAGCAGTTACTCGCCGAGACTTATGTTACCGCGGCGTTAGTCTTTAAGGACTATATGCCAAATTTTGGAAAATGGCTATGTGGATACTACCTGCGTCTATGTACATATTGGTGTTGGGAGACGATCCCTGAGGGTAATCCGCGTGTACGGTGGGCGACCAGCTCACGGGGACAGGACTTCCTCAGAACCCGTCGCTGGGTGTATGAACAGAAACCTGCTTACCAACTGTTTTCCAGACTACCCACGCTCACCGATCCACTGGCTTATGCTATGAGGGCCTACCACCTGGAACTGCTGCAAACGAGCCTGCAAATGCTGCTTTGTGCTGACGAACGCAGCAGCACACGAGATGTCCTCGACCTGTGGTGGGCTAGCGTCGTTGGCGCTGCGGCAGCCTGCTTGCTGGCCGACGCGCCCGCCATCGCCGACCTCGCCGACAAACTGGCCGTTCTGCCGGACGAACTCGCCACCAGTGAGGATCCGCTGCCGGGTTCGCTGGACATGGCCTACAAAAGCCGGCGCGGGCTGCTCGCACTAGCTCACTGCTCAGATGAAGACAAGCACTCGAAGACCACTCACACGCTGCTCAAGACGATCCCTGAGGGTAATCCGCGTGTACGGTGGGCGACCAGCTCACGGGGACAGGACTTCCTCAGAACCCGTCGCTGGGTGTATGAACAGAAACCTGCTTACCAACTGTTTTCCAGACTACCCACGCTCACCGATCCACTGGCTTATGCTATGAGGGCCTACCACCTGGAACTGCTGCAAACGAGCCTGCAAATGCTGCTTTGTGCTGACGAACGCAGCAGCACACGAGATGTCCTCGACCTGGTGAAGCTGATTATTGATGACGTGTCCACAGACGCGCCCCATCACTCAGGTTGCTGGGACCCGGTGTTAGAGTGGTGGGCTAGCGTCGTTGGCGCTGCGGCAGCCTGCTTGCTGGCCGACGCGCCGGCCATCGCCGACCTCGCCGACAAACTGGCCGTTCTGCCGGACGAACTCGCCACCAGTGAGGATCCGCTGCCGGGTTCGCTGGACATGGCCTACAAAAGCCGGCGCGGGCTGCTCGCACTAGCTCACTGCTCAGATGAAGACAAGCACTCGAAGACCACTCACACGCTGCTCAAGGTTTGTGATGTCGCCGGAGCCCGGCTAGCGGATTCCTTGGCGTATTACTGCTGCCGGAAGCCGACACAGCTCATGATGCTGATGCAGGTCCTATGCTGTGATTGGGTGCTGGAGGTGAGAGCGGGGGTGTGGGAGGCGCGCGGCGCGGGAGGGGGCGGGTCGCCCGTCCACAACCAGCTGGCTGGCTTCCAGAGGGATTTACATTCTTTGAGGAGGCTGTCGCAGAACTTACCGTGGGTGACGTCAGCCCACAAGGACGTGAGGCGGCACTGCCGCATGATGGCGGGCGCGGCGCCGCGGCGCACGCAACAACTGCTGGACGGGAGCCTCAGACCCAGGTCTAACAGGACCTCGCTGATATGCGGCAAGGAGCGTGCGTTAGAGGGCGGGGGTGGGGAGGGCGAACGTGCGGTAGCTTTATACATGGCGTGCAAGCATCTCCCGGCGGCGGTGCTAGCGACCCCCGGCGAGAGGGCCGGCATGTTGGCGCAAGCTGCAGCTACGCTACAGAAGATAGGCCATCGTTCAAGACTACCACACTGCTACCACCTCATGAAGACCTTTGGCACTCTGCCCGCGCCTTGA

Protein sequence:

>DPOGS215981-PA
MDPMEPLINNDVFNVNEIAEIEDFLNGCDGDFMKKLEEELVFADNDTGLLSVDTKFSTNVSSPQDSPYYTAPGNPLVSQHPQKRKLPPFTQQPKTSPVLGVYGREESYNLEVKAESHQLPEVLLQKAQNQTVQTPMFVQQVIPKPQYVALGSLQKLPDGVASLVQLDSPINQNTKAQPAAKPLLLPNNAKGVTPVILKSSDSNFSPVILQSNILNPETQTLMYTSAPVQGTTQSIITNSGAESQPVHTFFASNNGPTLVTGIPLVLDGDKISFNQSSNGSPPKVKEVKRSAHNAIERRYRTSINDRIVELKNMLVGEEAKLNKSAILRKTIDYIKYLQNQNTRLKQENIALKLVCQKSGVKDVVFDGAYTPPHSDISSPYHSPHGMDSTPSSPESKVEEKYSKIVIGMGDHSRLALCAFMIGLIAFNPFSAFFGSFMSESSYDYNARLDQRRILSEDSFGAGNVSWGAWLFNMFLIYLVNTIILGGCLIKLLVYGDSVPKSQSKEAGLFYKHKQQANNHLKKNDLENARSELNRALSVCGRSVQAGGWGRYSALTAAVMRQILQRLPLGGFLARRAGDLWGDSPARRATQHWAKEVSMVSHKLAQLEILSNQTSGSKCVLLALQAVNLAEVTSDKQLLAETYVTAALVFKDYMPNFGKWLCGYYLRLCTYWCWETIPEGNPRVRWATSSRGQDFLRTRRWVYEQKPAYQLFSRLPTLTDPLAYAMRAYHLELLQTSLQMLLCADERSSTRDVLDLWWASVVGAAAACLLADAPAIADLADKLAVLPDELATSEDPLPGSLDMAYKSRRGLLALAHCSDEDKHSKTTHTLLKTIPEGNPRVRWATSSRGQDFLRTRRWVYEQKPAYQLFSRLPTLTDPLAYAMRAYHLELLQTSLQMLLCADERSSTRDVLDLVKLIIDDVSTDAPHHSGCWDPVLEWWASVVGAAAACLLADAPAIADLADKLAVLPDELATSEDPLPGSLDMAYKSRRGLLALAHCSDEDKHSKTTHTLLKVCDVAGARLADSLAYYCCRKPTQLMMLMQVLCCDWVLEVRAGVWEARGAGGGGSPVHNQLAGFQRDLHSLRRLSQNLPWVTSAHKDVRRHCRMMAGAAPRRTQQLLDGSLRPRSNRTSLICGKERALEGGGGEGERAVALYMACKHLPAAVLATPGERAGMLAQAAATLQKIGHRSRLPHCYHLMKTFGTLPAP-