Monarch geneset OGS2.0

DPOGS202882
TranscriptDPOGS202882-TA1020 bp
ProteinDPOGS202882-PA339 aa
Genomic positionDPSCF300126 - 408407-412654
RNAseq coverage1668x (Rank: top 8%)
Annotation
HeliconiusHMEL0048712e-7955.15% 
BombyxBGIBMGA004158-TA2e-12267.06% 
Drosophilavis-PB6e-4857.75% 
EBI UniRef50UniRef50_C0KWE98e-13469.32%ACHI protein n=8 Tax=Endopterygota RepID=C0KWE9_BOMMO
NCBI RefSeqNP_001153660.12e-13469.32%achintya [Bombyx mori]
NCBI nr blastpgi|2376489643e-13369.32%achintya [Bombyx mori]
NCBI nr blastxgi|2376489643e-13970.68%achintya [Bombyx mori]
Group
Gene OntologyGO:00036771.2e-20DNA binding
GO:00063551.2e-20regulation of transcription, DNA-dependent
GO:00055153.3e-16protein binding
GO:00056342e-15nucleus
GO:00435652.9e-11sequence-specific DNA binding
GO:00037002.9e-11sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[92-156] IPR0122871.2e-20Homeodomain-related
[75-148] IPR0090573.3e-16Homeodomain-like
[109-148] IPR0084222e-15Homeobox KN domain
[91-156] IPR0013562.9e-11Homeobox
Orthology groupMCL18824 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202882-TA
ATGTTACCACCTACGGCCGGTATAGCACAAGATCAACGTGACCGGACCGACATGGCGTATCAACAACGACGCGACGTACGAGAAATTACAAGAGACATTATCAGAGAGGCTCGTGTAACAAACAGAAAACAATCTATGTCGCCTGGAGCACGTGATCGTTTCCCTTCCACGTCGTCAACGGACGACAATGAGTCGGGTACGGATGGAGAGCTCGAGAGACGGAGGGTCTCACATGGTGCACACATCGTACAGCCACCTGGTGGCATCATAGTCCGGAAGCGAAGAGGGAACCTTCCGAAACATTCAGTGAAGATATTAAAGCGTTGGCTGTATGAACACCGCTACAATGCTTATCCCAGCGATGCTGAGAAACTAACCCTCAGTCAGGAAGCTAACCTCACAGTGCTACAGGTATGCAACTGGTTCATAAACGCCCGTCGTCGCATCCTCCCGGAGATGATCCGTCGCGAGGGTCACGACCCCTTGCACTACACCATCTCCCGCCGAGGAAGGAAGCAGTTGGGTGCAGGGTCTAGCTCACAGGCCGCCGCGTGGGACGGGGAGGGCGAACACGAGGGAGAGGGCGAGGGCGATGGAGACCGCGGCCGCGATCACGACTACGGCGACGGTCTGCTGGTGTACCGCAGCGACGGAGACGAGGCGGCGGACGCCGAGGAGGGCTACTCCTCCAGCGCCATCTCCGAGGAGGAGGTCAAGTACGACCCCTCCGTGTGGCAGTCCGTCATACGGTACGGGCCCGAGGATAGGGACCTGCATCCCGCGGCCAGAGGATCAGCGGCGGTGGTGACAGTGCGGACAGCGGAGGACGGGGAGGCGGAGGTGTCCCCGGGGGGCGGCGCCGGGGGTTGGCACCCTCAGCTGGCTCATCTGCCGCAGCTGCCTCACCCGCTGCGCCGCAAGACTGACGAACGTGACAAGTTTAAATGCCTGTACCTCCTCGTGGAGACGGCCGTGGCGGTTCGTCAGAGGGAACAGGAACAGGAGGAGGTCCCCGTCTAG

Protein sequence:

>DPOGS202882-PA
MLPPTAGIAQDQRDRTDMAYQQRRDVREITRDIIREARVTNRKQSMSPGARDRFPSTSSTDDNESGTDGELERRRVSHGAHIVQPPGGIIVRKRRGNLPKHSVKILKRWLYEHRYNAYPSDAEKLTLSQEANLTVLQVCNWFINARRRILPEMIRREGHDPLHYTISRRGRKQLGAGSSSQAAAWDGEGEHEGEGEGDGDRGRDHDYGDGLLVYRSDGDEAADAEEGYSSSAISEEEVKYDPSVWQSVIRYGPEDRDLHPAARGSAAVVTVRTAEDGEAEVSPGGGAGGWHPQLAHLPQLPHPLRRKTDERDKFKCLYLLVETAVAVRQREQEQEEVPV-