Monarch geneset OGS2.0

DPOGS205052
TranscriptDPOGS205052-TA3924 bp
ProteinDPOGS205052-PA1307 aa
Genomic positionDPSCF300074 - 460253-465060
RNAseq coverage80x (Rank: top 64%)
Annotation
HeliconiusHMEL0156020.062.96% 
BombyxBGIBMGA006927-TA0.050.15% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|2420053556e-1521.11%hypothetical protein Phum_PHUM065820 [Pediculus humanus corporis]
Group
Gene OntologyGO:00056346.4e-06nucleus
GO:00063556.4e-06regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[6-94] IPR0115986.4e-06Helix-loop-helix DNA-binding
Orthology groupMCL25290 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205052-TA
ATGGATCCACCTTCTAGTAATAAAAAACGTACTCCAAGCCGTTGTAGAGAATGGGAAAAGCAAAGACGTATAAAATTTAATGATGCTATATCAAAACTTGGAGATATCGTCAAATCTATACACAAAGCAAACAGTCACATGAATGGTGAAGAGCTTGACAATGCCAACTATCCAAAAATTGAAATCGTTCAAAAAGCAATTATATGCTTGACAAATATAATGCATGAAAAAACACAATTAAAGGCTGAAATTCTAGCTTTAGAAGTAAAATTAGAAGCCATTGAGAAACAAAAGCAAAATAAAAAAGATGTATCACTTCAAGTCACAATTGGTTTAAACAAAAAAAGTCAAAATAACAAATATGTAAAATTATTAATGCTTCAAAAATCAAAAGAAAACAATGGAAAGGAAAAGGAAAAAACAATAACCCAAAAAACAAAACCTAAGATAGATACAAATACATTAAACAAAAATCCACCAAAGTTACCAAAATTACTGCCACTAACAAATATGAAAAAAGAAAATACTATTGTTATGTTGCCAGCTACACCATATATATTTCCCCAACGGCCAGTATTATTTCCTCAACCACCAACTATAGTTTTGGTAGATACAAATCTTCAAACACTTAACAAAACAACAACAATACCAGTTATCAATAGAAATACTAATGACATCACTAGAACAACAATGGTCAATGTGTTACCAATATCAGCATATTCACGTCCATTGTCTGCCCTGAAATCAAAAAAGGGGAGTAAGGCGAAAAATAACTCGCCTAAAAAAGGGGGTAAAAAAACAAAATCCAATGATACTCCTGCAGAAAATAATAAAAATGAAGAAAAGCCTGATGATAAACTCAATCCAGGTAACAATGAAAAAGCCGTTGGTGACCTGAAAGCCTCAGATACAGAAACTACTGCCGAAATAAATGAGCAATCAAATTCAAATAAAGAACTAAGCCCGAAAGAAGCACCAAACTTAAACGATACCAGCATTGTTGCCTGTGAGACAAACAATAATCTGCCTTCTGAAGAAACTGTCGATTTAAATAAAATTACAGAAACGCCCAAAATAGTCGAACCATTAATGGATAAGCCAAATAATATTAAAATAGCCCCAACAAATCTTGCATGTCATCCAGAAAATGTACCAAAAACAGTGTCAGTGGATAATGTCAAATGTGACAAAATTCCTGTTGAGAAAACTTTAGGCGCTGAATACAAAAACAAAGATAATAAGTTACCACCTATAATTGACCCAACTATTTGTGAGAATGTCGTCGATGGTGGTAATGCAAGATTGGAATTAGCAGAAGAATTCTTAGCCGCTTCACCAACAGCTGCTTTTTTGATGTCATTCCCATTAGTGAGTGGCAACAGAGCGGACAGCCCTGCTGAAGAAGCTAATAATACAAATGCAAAGGACAATCGACGCGTTGAAATAGCGCAGCCGGTGTCATACTTTGATAAATCTAACACATCCGATAGTAAAACGAAAGCTTCCATCAAACAGAATGTCACCACAGTTCAAAATACCAATAAGGTAGCCGAACAACAAAAATATGAAAGCAATCTAAAAAACACAGAGGTAAAAGTCACTGCGCCTATATCAAGTGTGACAACCGCTAATGATAATCCATTTCTTAACTTACCTCTGCCATCAATAATATCATCCAGTTGCAATCTAGCCGATGCTACATTTGGTATTGATTTTGATTGCCACGTCACCAAAGCTGGTACCACAATAACAACATCACACAGTAATTCTAACAATTTTATGTACAAAAGTGAACCATTCAATGCAGTTAAAAGCACTATTTATAGTACCAGCAGTATATCGTCTGGACATGAATTTAATAGCTTGGGATTGTATCCGTGCGCTATGGATAATTATTCAAACAAAAATAAGCCTGACCTGACCAACGTTGAGGACAATTTAATGAAGATAAATTCATCGAGGCTGACATATGACATTGATTTAGGATGGTCTCACAAAAGTTTCGATTTCGTCAATTGTACAACCAGCGCGAATACGTTTCACAAAGATACTATATTAACTACTGTGTCCACGCCATTCTCTACAAACTATAATCCGTTCAATCCAGACTTCCACGTCCCGTTAGTTCCTAATTCTAATAAGAAAAACCTTATCAGTAAGACAACTACATTCCCCGATCAGATCACAAACTTTTATTCCCAAGGTGGTAACTTATGGTCTGACGAAGTATCATCTATATACACAAATAGTAATGTTTCGAAGAATTTTATATCGAAGCAACAAAATTATTTTCCCGTGGAACATTTACAACCGAATGTCCATACGAAAACGAGCACAGCCAAACAGTTTGATACGAAACATATATCAGAAAGCGCTACTGAAACTAACTTAAAACCGGCGACTGCTGTGGGACAAGTGGTTGAAAAATATACCAAAAAGTCTCCAAGCAAAATGCATATTAATTGGATGACGTCAGAGATAAGACCAATGCAAAATAATTGCAATCAAACAACAGCGGAAATGAAAGAAACCAAATTACCATATTCCCACGTGGAGCAACTTCCAAAAAAACAAATTCCACAAAATGAAAGCAATTATTTTCCTATCAATATGCACCATTTCCCAACTCAAGCTAATCACGAAGATGTTCAAGTATGGCCGACAGCTCGACCCGCTGGCACTACAGAAATAAGTATCGATCCGCCGCCGATAAATTTGCCAACTTTAATTGGAGATCTAGCACTTGGTCCACATGACAAGAAGAAGGCTGATATTCTAAACAGATCCGTTCCTCATTCCGATTTACAAAACTGTGGAAATTTCCTATCTGTCACCCAGTTAATGAATCGCACCACAGAAAATATGCCACAGCGATCTAATGTGCTAATGGCTGATCAGAAATCCCTAGCGGCGAAACAAAATTTACCTCATATCGTTAATGATAATAGAAAAACGATGACTACGTCTCAGACGAATGTGGGTTACGGTTTTAACGACTCGAAAGCACTTAACTCTTATGAAAATATAAATCAATTTCTACAAAATAAATCAAAAACTTCCCTGAAACCCGAAAAAAATGCAAAAGCGCATAAAAATAATTATTCCGCCGAAGCACTGATACGTGGTGGAACAAATTACAATCAAAAACTACAAGACCATTCCAGTAACAAATTTATGATGCCCGCTCAGAAATATAACGATTTTAACATCCAAGATTCGGGAGTCGCCCAAGTGTCTCATTTTCCATCTATCATTGACTATTCCGACAACAGCTACACTGGACAGCAATTCACAGGGACAGCATTATACAATTCAACTACAAACACGATATCAAATTCTTTTTACTCCAATTTCATGCCGGGAAGCAGTAATTTGATGTCGGGAAATTACACGGCAGCGCCTTTCAGCAGCGAGTTTGTTGATTACAACCAGACGATGGAATGTAACTATACGAACCACAAATATAACGAGGTCAAAATGAGAAACAACACAACCGCGTTCCAACAGGATAAAGAAACAACTAATTACAAGAGTTCAAGAAGAGAATCTGCAGCCAAACATAAATTGGAATGTTCGAAGAAAGATTCCAATAAAAAATATCAAAGTAAAAGACCAAAATTAACTAACGAAGTCGAAGAATGGAACGATAGTTCCCATTTGCTCTGGCAGAATAAAACGCCATCCAAACGGCATCAAAACTTAATGTCAGATGAAATTCCATTTCCGAACTATGTGGGAAATCAAATGCCAACTCAGTATCAGCCAGATTTCTTTAATAGCCATATAATGCCATCCAACATGCAGGGCGTGGCTAATGCTGATCGTTCCTTGGCAAGCTTCCCAGTAGCATCTCGAGCTAACTTTAACCTAAGCGCTTTATTTCCGGAGATAACTATGGTAAAAAGTGCAATGACATTGAAATGA

Protein sequence:

>DPOGS205052-PA
MDPPSSNKKRTPSRCREWEKQRRIKFNDAISKLGDIVKSIHKANSHMNGEELDNANYPKIEIVQKAIICLTNIMHEKTQLKAEILALEVKLEAIEKQKQNKKDVSLQVTIGLNKKSQNNKYVKLLMLQKSKENNGKEKEKTITQKTKPKIDTNTLNKNPPKLPKLLPLTNMKKENTIVMLPATPYIFPQRPVLFPQPPTIVLVDTNLQTLNKTTTIPVINRNTNDITRTTMVNVLPISAYSRPLSALKSKKGSKAKNNSPKKGGKKTKSNDTPAENNKNEEKPDDKLNPGNNEKAVGDLKASDTETTAEINEQSNSNKELSPKEAPNLNDTSIVACETNNNLPSEETVDLNKITETPKIVEPLMDKPNNIKIAPTNLACHPENVPKTVSVDNVKCDKIPVEKTLGAEYKNKDNKLPPIIDPTICENVVDGGNARLELAEEFLAASPTAAFLMSFPLVSGNRADSPAEEANNTNAKDNRRVEIAQPVSYFDKSNTSDSKTKASIKQNVTTVQNTNKVAEQQKYESNLKNTEVKVTAPISSVTTANDNPFLNLPLPSIISSSCNLADATFGIDFDCHVTKAGTTITTSHSNSNNFMYKSEPFNAVKSTIYSTSSISSGHEFNSLGLYPCAMDNYSNKNKPDLTNVEDNLMKINSSRLTYDIDLGWSHKSFDFVNCTTSANTFHKDTILTTVSTPFSTNYNPFNPDFHVPLVPNSNKKNLISKTTTFPDQITNFYSQGGNLWSDEVSSIYTNSNVSKNFISKQQNYFPVEHLQPNVHTKTSTAKQFDTKHISESATETNLKPATAVGQVVEKYTKKSPSKMHINWMTSEIRPMQNNCNQTTAEMKETKLPYSHVEQLPKKQIPQNESNYFPINMHHFPTQANHEDVQVWPTARPAGTTEISIDPPPINLPTLIGDLALGPHDKKKADILNRSVPHSDLQNCGNFLSVTQLMNRTTENMPQRSNVLMADQKSLAAKQNLPHIVNDNRKTMTTSQTNVGYGFNDSKALNSYENINQFLQNKSKTSLKPEKNAKAHKNNYSAEALIRGGTNYNQKLQDHSSNKFMMPAQKYNDFNIQDSGVAQVSHFPSIIDYSDNSYTGQQFTGTALYNSTTNTISNSFYSNFMPGSSNLMSGNYTAAPFSSEFVDYNQTMECNYTNHKYNEVKMRNNTTAFQQDKETTNYKSSRRESAAKHKLECSKKDSNKKYQSKRPKLTNEVEEWNDSSHLLWQNKTPSKRHQNLMSDEIPFPNYVGNQMPTQYQPDFFNSHIMPSNMQGVANADRSLASFPVASRANFNLSALFPEITMVKSAMTLK-