Monarch geneset OGS2.0

DPOGS202775
TranscriptDPOGS202775-TA1764 bp
ProteinDPOGS202775-PA587 aa
Genomic positionDPSCF300018 - 1012265-1027328
RNAseq coverage1706x (Rank: top 7%)
Annotation
HeliconiusHMEL0026793e-8153.28% 
BombyxBGIBMGA010496-TA1e-10155.28% 
Drosophiladl-PC6e-10461.46% 
EBI UniRef50UniRef50_G3LF423e-14451.54%Dorsal n=1 Tax=Helicoverpa armigera RepID=G3LF42_HELAM
NCBI RefSeqNP_001036896.12e-12845.83%embryonic polarity protein dorsal isoform B [Bombyx mori]
NCBI nr blastpgi|3469877651e-14351.54%dorsal [Helicoverpa armigera]
NCBI nr blastxgi|3469877651e-14253.15%dorsal [Helicoverpa armigera]
Group
Gene OntologyGO:00063553.9e-70regulation of transcription, DNA-dependent
GO:00037003.9e-70sequence-specific DNA binding transcription factor activity
GO:00056341.2e-67nucleus
GO:00055154.5e-10protein binding
KEGG pathway 
InterPro domain[55-241] IPR0089673.9e-70p53-like transcription factor, DNA-binding
[57-232] IPR0115391.2e-67Rel homology
[234-371] IPR0147562.4e-36Immunoglobulin E-set
[234-348] IPR0137836.4e-34Immunoglobulin-like fold
[62-79] IPR0004514.2e-29NF-kappa-B/Rel/dorsal
[236-335] IPR0029094.5e-10Cell surface receptor IPT/TIG
Orthology groupMCL10541 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202775-TA
ATGGACAACGGCGGAGAGGGCGTGCTGCATGTAGACCACAGTGAGGCGGGCCAGCCGTCCAACCTGAATATAAGTGATGTCATTGAGGCGATCACGAAGGCGGACCCGCTGTTCGGGCCGGGAGTAGAGGCAATGCCGCGACCTCTGACCTCCCACCCGGGGCAGACCTACGTCCGCATAGTGGAGCAACCCGCCGGGAAGGCGCTCAGGTTCCGGTACGAGTGTGAGGGTCGGTCGGCGGGCTCCATCCCCGGGGTGAACAGCACCCTCGAGAGGAAGACCTACCCCACCATAGAGATCGTGGGCTACAAGGGAGATGCCGTCGTCGTGGTGTCCTGCGTCACCAGGGAACAGCCCTACAGACCTCACCCCCACAACCTGGTGGGTCGCGAGAGGTCATGCGAGAACGGAGTGTGCACCGTGAAGAGGAGCATCAGCGAGGAGTCGCCACAGGTGTCCTTCAGTAACCTGGGGATACAATGTGTCAAGAGGAAGGACATCGCCGAGGCGCTCAAGACGAGAGAGAGGCTCCGGGTGGACCCTTTCAAGACCGGTTTCGGTCACCGCAACAAGCCGCAGAGCATCGACCTGAACACTGTGAGGCTCTGCTTCCAAGTGTTCCTCCCTGACGAGCGGACAGGCAAGATCAAGCACTCGCTGCCGCCCGTCGTGTCCGACGTCATCTACGACAAGAAGGCCATGAGCGACCTGGTCATCATGAGGGTCAGCCAGTGCTCGGACTTCGTCAAGGGAGGCGCCGAGATCATCCTGCTGTGCGAGCGAGTGACGCGCGAGGACATCTCCGTGGTGTTCTTCCAAAAGGAGGGCGACAACGTGGTTTGGGAGGAAAGCGCGCACATCGTGCTCGTCCACAGGCAGTACGCCATCGCTTTCCACACGCCGCCCTACAGAGACCAGGCGGAGACAGGACACGTGCAGGTGTATCTCCAGCTGAAGCGGATCTCGGACAACGCCCGCAGCAACGCCGTGCCGTTCGAGTACATCCCGGAATACCAAGATACGAACTATTTAAAGCATAAGAGGTTGAAGAAGTTACCTTCGGTGTTGCATACTTCCTACGACACAGACAGAAGCTACCAAGGAGATCAGAAGATTAAAGCGGAACCCAGAGACAAAACTCCGCCTCACCCCGCGGCCAGCCCGCTGCAAGTGTTCTCGCCACACTACGAGCAGGAACAGATGCAGCAAGACCACTACCAGCAACAGGCCTGGGGAATGCAAGGCGGATTAAACATGGCCGGGCCGAGCCATATTAACTATGGTCAGGACTTGCAGTGGAGTCCGAATTACGTCCAACTGGGTTCCAACCTCCAGCCTCTGTCACCGAATATGACGACCCTGACCTCCAACATGCAACAGATGTCAAGCCTGCAACCTCTGTCGCCGAATATGCAGAGGATGTCGCCGAATATGCAGGCCATGTCTCCCAATATGCAAGCGATGTCTCCGAACATGCAAGCACTATCGCCCTTATACGTCCGGTACAGACCCAACATGCAAGCAATGTCTCCGAACATGCAAACGATGTCTCCAAGCATGCACGGCATGTCTCCTAACCTGCAAGCGATGAACACCAACATGCAGATGGGTATGGCGCCGCTGCTGGAGTCGCCGCTGGGAGAGCCTCTGACGTCTTCGGAGCTGTCCGGACTGGCCGCTCTGCTGGACCGAGGACCCGACCTCAGCGACAGCCTCAACCGCCTCTCCACCGGGGACCTCTACCCCATCTGCAGCGGAGACTAG

Protein sequence:

>DPOGS202775-PA
MDNGGEGVLHVDHSEAGQPSNLNISDVIEAITKADPLFGPGVEAMPRPLTSHPGQTYVRIVEQPAGKALRFRYECEGRSAGSIPGVNSTLERKTYPTIEIVGYKGDAVVVVSCVTREQPYRPHPHNLVGRERSCENGVCTVKRSISEESPQVSFSNLGIQCVKRKDIAEALKTRERLRVDPFKTGFGHRNKPQSIDLNTVRLCFQVFLPDERTGKIKHSLPPVVSDVIYDKKAMSDLVIMRVSQCSDFVKGGAEIILLCERVTREDISVVFFQKEGDNVVWEESAHIVLVHRQYAIAFHTPPYRDQAETGHVQVYLQLKRISDNARSNAVPFEYIPEYQDTNYLKHKRLKKLPSVLHTSYDTDRSYQGDQKIKAEPRDKTPPHPAASPLQVFSPHYEQEQMQQDHYQQQAWGMQGGLNMAGPSHINYGQDLQWSPNYVQLGSNLQPLSPNMTTLTSNMQQMSSLQPLSPNMQRMSPNMQAMSPNMQAMSPNMQALSPLYVRYRPNMQAMSPNMQTMSPSMHGMSPNLQAMNTNMQMGMAPLLESPLGEPLTSSELSGLAALLDRGPDLSDSLNRLSTGDLYPICSGD-