Monarch geneset OGS2.0

DPOGS201649
TranscriptDPOGS201649-TA1176 bp
ProteinDPOGS201649-PA391 aa
Genomic positionDPSCF300254 + 69938-75747
RNAseq coverage342x (Rank: top 34%)
Annotation
HeliconiusHMEL0156592e-13582.64% 
BombyxBGIBMGA008198-TA3e-12775.22% 
Drosophilaoc-PC5e-3398.39% 
EBI UniRef50UniRef50_D5MTK06e-15777.89%Orthodenticle homologue n=1 Tax=Bombyx mori RepID=D5MTK0_BOMMO
NCBI RefSeqNP_001171667.11e-15777.89%orthodenticle [Bombyx mori]
NCBI nr blastpgi|2960807422e-15677.89%orthodenticle [Bombyx mori]
NCBI nr blastxgi|2960807423e-17379.40%orthodenticle [Bombyx mori]
Group
Gene OntologyGO:00063551.4e-23regulation of transcription, DNA-dependent
GO:00435651.4e-23sequence-specific DNA binding
GO:00037001.4e-23sequence-specific DNA binding transcription factor activity
GO:00055152.5e-23protein binding
GO:00036773.9e-23DNA binding
KEGG pathway 
InterPro domain[12-74] IPR0013561.4e-23Homeobox
[3-81] IPR0090572.5e-23Homeodomain-like
[9-77] IPR0122873.9e-23Homeodomain-related
Orthology groupMCL25446 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201649-TA
ATGAAATACTTAAACTGTGTGAATCCTCGTAAGCAGCGGAGGGAAAGGACGACCTTCACTCGCGCTCAGCTGGATGTTCTGGAAGCTCTGTTCGGGAAGACGAGGTACCCGGACATCTTCATGAGGGAGGAGGTCGCGCTCAAGATAAACCTGCCAGAGTCCAGGGTTCAGGTCTGGTTTAAGAACCGTCGCGCCAAATGTAGACAACAGCTCCAACAACAGACGAACTCGCAGATCAGCAGCAAATCTTCTTCATCATCAAAGACGATATCCGCAGCCAGCTCAATGAAGAGCTCCTCCAACAAAATCTCCTCAACGAAACCCTTAACCTCCTCCAACTCCTTGACCTCCACGCAGAACAGCATAACCACATCTTCCCCCAACCTCCCCACCCCCACCACATCCGTCAGCCCCCCCATCAACGTCATATGCAAGAAAGAATCCGCGCCCAGCTACGACTCGTCCCCGCTAAACAAAAACCTCACATCCATCAACCATTACACCAACGATCTCCGGACCAAAGATTCCTCTCCATCCGAGGTGAACGTCCACAATTACGGAGTCACGCCCAAACAAGAGCTATACAACAGTCCCAAAAGACTTGAATACAATAAAGTGGACTATACAGAGAAATTCGGTGGGAATCTGGTCAAAGAGCAGTACAACTCGCGCCTGACGGGCAACCTCACACCGCTGGGGTCCAATTCCTCTATAATGACGACCCCATCACCCCCAATCACCCCCCAGAGCATCAACGGGCCGGGGAACGTTTATCACCCCGACAGCTACAACAGCTTCCACTGGCCCGGCAGCTCGGACTACATCCGCAGTTACTCCGGCACACATCACCAGGGATACGGACAGAGTTACAACTCGCCTTACTATCCCTCACAGATGGAATACTTCAACGGGTCATCAGTGAACCAGGCTCACGGGAGTCACCACGGTCACAACCTGTCAGCGAACGGCAACCAGCTGTACAACCAGGTGTCGTTCGGCAACCCGTCCCCGCCGGCCGCCTGGAGCTCACACAACATACTCTCGTCGGGAGCCGAGTCCCCCGACAACCAAGCCCAGAACCACCTCAGCATGATCCAAGCATCACAAATAACGAACTTCCCCAACACCAGCAATAGTTACTTCGCGCCGGAGAAATACGGCATCAATATGGTCTAG

Protein sequence:

>DPOGS201649-PA
MKYLNCVNPRKQRRERTTFTRAQLDVLEALFGKTRYPDIFMREEVALKINLPESRVQVWFKNRRAKCRQQLQQQTNSQISSKSSSSSKTISAASSMKSSSNKISSTKPLTSSNSLTSTQNSITTSSPNLPTPTTSVSPPINVICKKESAPSYDSSPLNKNLTSINHYTNDLRTKDSSPSEVNVHNYGVTPKQELYNSPKRLEYNKVDYTEKFGGNLVKEQYNSRLTGNLTPLGSNSSIMTTPSPPITPQSINGPGNVYHPDSYNSFHWPGSSDYIRSYSGTHHQGYGQSYNSPYYPSQMEYFNGSSVNQAHGSHHGHNLSANGNQLYNQVSFGNPSPPAAWSSHNILSSGAESPDNQAQNHLSMIQASQITNFPNTSNSYFAPEKYGINMV-