Monarch geneset OGS2.0

DPOGS210998
TranscriptDPOGS210998-TA1077 bp
ProteinDPOGS210998-PA358 aa
Genomic positionDPSCF300004 + 355272-356638
RNAseq coverage86x (Rank: top 63%)
Annotation
HeliconiusHMEL0250494e-2065.08% 
BombyxBGIBMGA006478-TA4e-1343.06% 
Drosophilaeve-PA3e-1348.44% 
EBI UniRef50UniRef50_Q1L9751e-1144.58%Even-skipped homeobox 1 n=7 Tax=Clupeocephala RepID=Q1L975_DANRE
NCBI RefSeqXP_001969116.17e-1248.44%eve [Drosophila erecta]
NCBI nr blastpgi|2135121041e-1147.89%even-skipped homeobox 2a [Salmo salar]
NCBI nr blastxgi|2135121046e-1241.18%even-skipped homeobox 2a [Salmo salar]
Group
Gene OntologyGO:00036773e-19DNA binding
GO:00063553e-19regulation of transcription, DNA-dependent
GO:00435654.8e-19sequence-specific DNA binding
GO:00037004.8e-19sequence-specific DNA binding transcription factor activity
GO:00055153.4e-18protein binding
KEGG pathway 
InterPro domain[32-99] IPR0122873e-19Homeodomain-related
[41-103] IPR0013564.8e-19Homeobox
[28-98] IPR0090573.4e-18Homeodomain-like
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210998-TA
ATGAAATATTCAGTTTTAAAGCAATTGGCTAATGAGAGGCAGTTGAAGACAAATGACAAACCAGCTACAGTTCCACGTGAGGATGTACAGGGTTTATCGACACGTTATCATGAGAAGAAAACTCGAAGATTTCGTAGTGCATTCACAACTGATCAAATTAAATATTTGGAAAAACAATTTCAAAAATTTCCATATATCGAAAGTGGTAGTCGAAGGGCGATTGCTAGTGTATTAAACATACCTGAGAGGACTGTAAAGTTTTGGTTTCAAAATCGAAGAATGAAAGAGAAGAAGGAGTCTCTTATTAAGGACCTAGTCGGTGAGGGACAAAGCAAAATTAATGGAAATCTTACTGATGACAAGATGGACGTCAAGACTTTTAATTACCAATCCCATTCTGGCCATTTGTCATATCAATTGAAACCAATATTAGTATGTCCAGATAAAACGGATAGCAGTTCCAGCAATGTTATAAATTGTTCTAAGGAATTTTCTAAATCAGTTCATGAAAAGCAGAGTATTGCAGGGAGGAATTTGGAAAACGTGAAAATATTACAATCGAACCTTATTAAAGCCGAACCGACTCACATAAATGGCACTACAAATGTCAACTTGTTACGAAAAAATATATGTGGAACAAAATTACCAACAAGGAATATGCCAGTGAACCAGAAGCTAAAAAATATAAAAGCGAAAGACAGAAACCCTATGGACTATAAATTAAAATCTTCTTCAAATCCAGAAGTTTGCCGACAGTATTATTATCAAATACCAAACGATCAAAGCTATGCGGCATTTCGGGTAAATCCATTAATCCAACATCCATCTGCACCAGGTAACATTGTTTGGAGACCTGTGAACACATTGTCTGTGGTACCCAACATAATAAATCCGTCGCAAATGATGGCCTATGATAACAGATATTTGCCACTCAAACAGAATCTTCCGAAAGATCAGTGCAAATGTAATTGTAAAATTAATAATACACAGGAAATGTCCCCTTATATACCTTACAATGTATCAACTGTACCATCAAAATATTTATTGACTATGCCTTTTAGTAATAATTTTGAATAA

Protein sequence:

>DPOGS210998-PA
MKYSVLKQLANERQLKTNDKPATVPREDVQGLSTRYHEKKTRRFRSAFTTDQIKYLEKQFQKFPYIESGSRRAIASVLNIPERTVKFWFQNRRMKEKKESLIKDLVGEGQSKINGNLTDDKMDVKTFNYQSHSGHLSYQLKPILVCPDKTDSSSSNVINCSKEFSKSVHEKQSIAGRNLENVKILQSNLIKAEPTHINGTTNVNLLRKNICGTKLPTRNMPVNQKLKNIKAKDRNPMDYKLKSSSNPEVCRQYYYQIPNDQSYAAFRVNPLIQHPSAPGNIVWRPVNTLSVVPNIINPSQMMAYDNRYLPLKQNLPKDQCKCNCKINNTQEMSPYIPYNVSTVPSKYLLTMPFSNNFE-