Monarch geneset OGS2.0

DPOGS205653
TranscriptDPOGS205653-TA1185 bp
ProteinDPOGS205653-PA394 aa
Genomic positionDPSCF300023 + 298051-299235
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0065907e-15581.59% 
BombyxBGIBMGA001002-TA2e-12567.25% 
Drosophilaac-PA6e-1857.50% 
EBI UniRef50UniRef50_A6N8691e-12267.00%Achaete-scute-like protein ASE n=2 Tax=Obtectomera RepID=A6N869_BOMMO
NCBI RefSeqNP_001098696.13e-12367.00%achaete-scute-like protein ASE [Bombyx mori]
NCBI nr blastpgi|1574123105e-12267.00%achaete-scute-like protein ASE [Bombyx mori]
NCBI nr blastxgi|1574123108e-14067.00%achaete-scute-like protein ASE [Bombyx mori]
Group
Gene OntologyGO:00036775.3e-30DNA binding
GO:00056343.8e-23nucleus
GO:00063553.8e-23regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[92-185] IPR0156605.3e-30Achaete-scute transcription factor-related
[87-160] IPR0115983.8e-23Helix-loop-helix DNA-binding
[97-161] IPR0010922.8e-19Helix-loop-helix DNA-binding domain
Orthology groupMCL26617 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205653-TA
ATGAGTTCCATAGGTGTTGTTGTGTTCCGTAATTCTCCGCTCAAACAGCAGGTGCTCCAAGAATCTGTGAATAATTCTGTAAATATAAGCAATAACGATAACGTCCGTCGTGAAATCATAATTTTGAGAAAGAAACAAAAACATCAATCCCAAAACACAGTGTCAGTGACGTCGCTAGTGAGTGCTTCAGAACCAAATAATATTGTCTTAGCTAAACGTCCTAAGCTACGTGAAAATATTCCTGATGAATCTACACGTACCCCTACACCACTAGCGGTAGCGAGAAGAAATGCGCGGGAAAGGAATCGTGTACGTCAAGTTAACGACGGTTTTGCTGCTTTACGGCGACATATACCTGACGAGGTCGCTGCAGCTTTCGAAAACGCGAATTCGAATAGAGGCCCTAATAAAAAGCTTAGTAAAGTTGAAACACTGCGTATGGCTGTAGAATATATTAGAAATCTAGAGAATCTTCTTAATATAGGGCATGGAGATAAAGAAAATATGTCACGACCATCAATGGAATCATTTCCATCACCAGCATCTTCATCTCCTAGAGACAACAGCCAGGAAAGAAGTTATTATTCCCTAAATTCACCAGCTCTTGATGATGAAGATATGGAGGAAGATGATTTGGACAGTTCCCTCCATCGACTCCCAACTCAGCAATATATGGACTTACCTTCTGAAACTTTTCAATTAGTTTCCACGCCACACTTATACGAGGAAGATGAATCTAGAAACCCACTTACTCCCTCATCTGACCTTCTCGGAGCGGAGGAAATGCATTCTCATGTTTTGGAAACACACTTTCCATTTCCTAACTCAGCTGAACAATTCACAGTTATTCCTGAACAAAACTATTGTGAACCAGAAGCTGCATTGAATGATGGAGATTTTGAAGTTAAATATGCTGAAACAATACAAAATATTCATCGCAACTTTGAAGAAGATTCACAGCTGCCATTAGAAGCAATCAATCCAGATTTGATGCTGTCACATAATCAATATAAATTCAAGGAAGAAAGTGAATATTTAGATCATCACTACAACGAGGTCGACCTTAAAAAGGAACTACCAGACATACAAGTTACTCCAGAAGATCGTGAACAATTCGAAGAGACTCTTAAATGGTGGCAAGAGAAAACTAGACAGGCGCGACCTATTCCTAAAAGCAGCACTTGA

Protein sequence:

>DPOGS205653-PA
MSSIGVVVFRNSPLKQQVLQESVNNSVNISNNDNVRREIIILRKKQKHQSQNTVSVTSLVSASEPNNIVLAKRPKLRENIPDESTRTPTPLAVARRNARERNRVRQVNDGFAALRRHIPDEVAAAFENANSNRGPNKKLSKVETLRMAVEYIRNLENLLNIGHGDKENMSRPSMESFPSPASSSPRDNSQERSYYSLNSPALDDEDMEEDDLDSSLHRLPTQQYMDLPSETFQLVSTPHLYEEDESRNPLTPSSDLLGAEEMHSHVLETHFPFPNSAEQFTVIPEQNYCEPEAALNDGDFEVKYAETIQNIHRNFEEDSQLPLEAINPDLMLSHNQYKFKEESEYLDHHYNEVDLKKELPDIQVTPEDREQFEETLKWWQEKTRQARPIPKSST-