Monarch geneset OGS2.0

DPOGS214553
TranscriptDPOGS214553-TA1605 bp
ProteinDPOGS214553-PA534 aa
Genomic positionDPSCF300266 - 82679-89014
RNAseq coverage1045x (Rank: top 12%)
Annotation
HeliconiusHMEL0160613e-13573.35% 
BombyxBGIBMGA003285-TA9e-8663.97% 
Drosophilada-PA4e-5384.55% 
EBI UniRef50UniRef50_E0VQ955e-7841.48%Protein daughterless, putative n=1 Tax=Pediculus humanus corporis RepID=E0VQ95_PEDHC
NCBI RefSeqXP_973272.23e-7955.33%PREDICTED: similar to AGAP008814-PA [Tribolium castaneum]
NCBI nr blastpgi|1892392125e-7855.33%PREDICTED: similar to AGAP008814-PA [Tribolium castaneum]
NCBI nr blastxgi|3287860994e-8441.59%PREDICTED: hypothetical protein LOC410553 [Apis mellifera]
Group
Gene OntologyGO:00056341.4e-22nucleus
GO:00063551.4e-22regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[428-487] IPR0115981.4e-22Helix-loop-helix DNA-binding
[433-486] IPR0010921.4e-11Helix-loop-helix DNA-binding domain
Orthology groupMCL16048 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214553-TA
ATGGCTCTACTTCCGGTGTACGGCAGCGATGAATTCGGGCACGATTCACCCTCCAGATACGCGTCTCCGAAGGGCGCCGGCGCCAGTGCTGGCTATCAGGAGCCCTACTACGGGGAGTGGGCGGCGGGGTACTACCAGCCGCCGCCGCCCTATCATCATGATCCGTACGCGACGTCGCCGGGCATATCCGGTGCGCCCACGGACGCGAGCGCGGGCGGGGCGGAGCTGCCTCTGCCGCCCATGTCGTCGTTCCGCGCCGCCGCCCCCGTACACTCGCCCAGCGACCCCATGATAGTCGCCAAGCCGCCCATGCAACCGATGTACGCGGGGTCGGCAAACACCCCATCGGGCGGCGGCGAGGGCGGAGGACCAGCGGGCGGGCCGGGCGCCGGAAGCCTGTCCTCGTACTCGTCTCCCTCCACGCCGGTGCACTCGCCCCCGCCGCTACACGCCAGGCTCTACCCCATGAAACACTCCCCGCACCACCACCACCACCATAACGGACAGCAAGCGAGCTGGGTATCCACGGGTGTGTCGTCCCCACCCACGGCGGCGACGCCCCACGCGCCTCTGACGGGAGCTGTACTCCCGAATGGTCACCAGCACGTCGTGTTCCCACCCGTTATGGGCGCGCCGGCTGAACAACGCCAGCTAGACGAGGCGATGGTCTTCCTGAGGGAGCACTCCGACGTCGGGGGCGCTCGTATGGAGGAGCGTCTCGACGACGCGATCAACGTCCTGAGGAACCACGCGGAGGCGCCCGACCTGTACCCTCAGGACCACCACGTGCCACCGCCCGGTGCGGTGAGTCGCGTGGGCGCTCTGTCACACCTCCACGAGCCGCCCGTCAAGATGGAGAGGCATCTCATGGCAAATACTAAGAAACGCAAAGAGCCCCCGGACTCCGGGCTGGACTCGAAGCCTTCCTCGTCAGGCTCTGACGCGCTCACCAAGCCTCCGGGGGGGAAGAGGTCCAGGAGATATGTGAACAGCTGCTCGTCCGCTGATGAAGACGAGCTCGACCCGGACGCCAAGGCGGCGCGCGAGAGAGAGAGGAGGCAGGCCAACAACGTGCGGGAGCGTGCGAGTGAGAGGAGATGCTCTCTCACCTGCGTCGTGCATGTCTGTATGTATCGTGCATGTGTATCACGAGCGCAGTCGCCGGTTTCCTACCATGACCATACAGGTGTCCCCTCTCTCGCGCGCTGCCTGCTGGACGGTTGTTCGTCAGCTGACGAGGACGACATGGACCCGGAGGCGAAGGCGGTCCGCGAGAAGGAGAGGCGGCAGGCCAACAACGCCAGGGAGCGGATACGTATCAGAGACATCAACGAGGCGCTGAAGGAGCTGGGCAGGATGTGTATGACGCACCTGAAGAGTGACAAGCCGCAGACCAAGCTCGGGATCCTCAACATGGCTGTGGAGGTCATTATGACGCTCGAACAGCAAGTCAGAGAACGCAACCTGAACCCTAAGGCGGCGTGTCTGAAGAGGAGAGAGGAGGAGAAGGCGGAGGACGCGCCCAAACTGTTGGCGGCGCCCATACACCATTACCAGCCCGTCACGGGCATGGGAGGCGCCCCACCCCCCGCGCCGCCGCAATAG

Protein sequence:

>DPOGS214553-PA
MALLPVYGSDEFGHDSPSRYASPKGAGASAGYQEPYYGEWAAGYYQPPPPYHHDPYATSPGISGAPTDASAGGAELPLPPMSSFRAAAPVHSPSDPMIVAKPPMQPMYAGSANTPSGGGEGGGPAGGPGAGSLSSYSSPSTPVHSPPPLHARLYPMKHSPHHHHHHNGQQASWVSTGVSSPPTAATPHAPLTGAVLPNGHQHVVFPPVMGAPAEQRQLDEAMVFLREHSDVGGARMEERLDDAINVLRNHAEAPDLYPQDHHVPPPGAVSRVGALSHLHEPPVKMERHLMANTKKRKEPPDSGLDSKPSSSGSDALTKPPGGKRSRRYVNSCSSADEDELDPDAKAARERERRQANNVRERASERRCSLTCVVHVCMYRACVSRAQSPVSYHDHTGVPSLARCLLDGCSSADEDDMDPEAKAVREKERRQANNARERIRIRDINEALKELGRMCMTHLKSDKPQTKLGILNMAVEVIMTLEQQVRERNLNPKAACLKRREEEKAEDAPKLLAAPIHHYQPVTGMGGAPPPAPPQ-