Monarch geneset OGS2.0

DPOGS208891
TranscriptDPOGS208891-TA2835 bp
ProteinDPOGS208891-PA944 aa
Genomic positionDPSCF300009 - 922604-939501
RNAseq coverage480x (Rank: top 26%)
Annotation
HeliconiusHMEL0088940.058.73% 
BombyxBGIBMGA002464-TA3e-5140.83% 
DrosophilaRel-PC7e-5941.89% 
EBI UniRef50UniRef50_G3LF450.051.51%Relish n=4 Tax=Obtectomera RepID=G3LF45_HELAM
NCBI RefSeqNP_001095935.10.046.61%nuclear factor NF-kappa-B p110 subunit isoform 1 [Bombyx mori]
NCBI nr blastpgi|3469877710.051.51%relish [Helicoverpa armigera]
NCBI nr blastxgi|3469877710.051.83%relish [Helicoverpa armigera]
Group
Gene OntologyGO:00063556.3e-50regulation of transcription, DNA-dependent
GO:00037006.3e-50sequence-specific DNA binding transcription factor activity
GO:00056348e-43nucleus
GO:00055151.8e-10protein binding
KEGG pathway 
InterPro domain[54-237] IPR0089676.3e-50p53-like transcription factor, DNA-binding
[592-814] IPR0206831.8e-47Ankyrin repeat-containing domain
[55-228] IPR0115398e-43Rel homology
[232-346] IPR0137833.1e-36Immunoglobulin-like fold
[232-345] IPR0147566.7e-33Immunoglobulin E-set
[60-77] IPR0004511.3e-14NF-kappa-B/Rel/dorsal
[234-336] IPR0029091.8e-10Cell surface receptor IPT/TIG
[853-936] IPR0110292.8e-09DEATH-like
Orthology groupMCL12230 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208891-TA
ATGTCTACAAGTGAGCAAGATGTAAGCGATTCCAATTTGGAATCGCCGTTTTCGCAATCTGATTCACCGTACAGTTCGCCGTCTCATCAGGTTCCACAGCTAGCCAACATAATTTCAGACTTATCATGTGCCGAAAATACAAATATGCCAAAAGGCAATATGCCATATCTCAGTATTGTGGAACAACCGCAGAACCACTTTAGATTCCGATACAAAAGTGAAATGATTGGAACGCATGGTTGTCTGTTGGGAAAGTCCTATGCCACAAGTAAATCCAAATCACATCCAACTGTTGAGCTAAGAAACTATACTGGGAGGGCGCTTGTAAGATGTAGGTTAGCTAGGCATGATTCTACTGATGAACATCCTCATAAATTACTAGAGGAAGATCAGGATAGAGACGTCCATAGCTGGTTACCGGAAAAAGGAAGCTATCGAGTTGCATTTAGGGGAATGGGTATAATACACACGGCGAAAAAAGATGTTCCCGCACTATTATACAAAAGATACGCCTCAGAGAAACCAGAAACGGCCTTTAATGAAAATAAGCTAAGATTGAAATGCGAGAACGAGGCGAAGAATATTAATTTGAATATAGTCAGATTGAAGTTCAGCGCTCACGATCCAAACACCGATGCTGAAATTTGTCCTCCAGTTTACTCAGAGTGGATACATAATATGAAGAGTGCAGCAACAAATGATCTTAAGATCTGTCGTATGAGCCGCTGTTACGGCCGACCGAAAGGCAAAGAGGACGTCTTCATATTTGTTGAAAAAGTAAATAAGAAAAACATAATGATAAAATTCTTCGAACTGGACAAATATGGAGAAAGGGTTTGGTCTAAAATGGCGACTTTCTTGCAGAGCGACGTACACCACCAATACGGCATTGTTTTCAGGACACCAGAATACCATAACTTACATATTACGTCAGATGTGAAGGTATTCATAGAATTGGTTAGACCGAGCGACGGCAGGACCAGCGAACCAAAGGAGTTCACATACAAAGCTGAAACTATATATATACAGAATAAAAAAAGGAAAGCTAACTCATCATTCTCCTCTATAGGAAGTTCAGGTAGTTCAATAAAAAGTGTCAGTGACCTGCCAACAACTGTAGAGTATGCTAATCAATTCGGAGAATTTAATAATATGACTAACAATGGCATTGAAGTAATGGACCATCCGCCGGTTGCGCCGATGTACCATTTCAAAGTACCCCAAAATCAAATAGCCCCAACAGAGGAGTCGATGATGGCGGATGCTCTATTGCACTCGTATGTGGGTTCCAATCCCGTACAGTCCAAGGCTATGAGTCCTCTCCTCAGTCAGCCGAACGTGCCCGAACCCCCCGTACTACAGCTCCACTCGTCTGAATTGGACCGAGTTCTTGAACAAGACATTGATTTACTTAGTGAAGACAAAAAACGATTTTTCAGTACTGATCTCGGAGATTACTTCGAACAATATGACGGCGTTGAGCGCAGCACGATGGAATGGATCAAGTCATCAATGATGGTCGCCGATTCTGCAAAACGAAGCGAAAAGAACACAAAGTACAAGGAATCATTGAAGTCGGAGGTAAAAGATGAACCGAAAAGCAATGTTATAAGAAAACCACCAAGTGAATACTCCGCGCTCTATAAGGCGGAAGACGGAGTCGAAGTTCAGAAACTCATCAAGGAACTCTGTGAAATGATGAGAAACAAGGACGGTATCAAGAAGCAGGATGTGAGAAGCAAGTTGGACAGGCTCTTGGAGATGAGACTGTCCAATGGTGACACTTTTCTCCACCTGACCCTGAGTAGTAATCAGCCGAGCTTGGAATATATTGTGAAGTTGATTTATAACATGAAGATGACTAAGTTATTGAACTTGAAGAACAACCAAATGCAGACTGTACTGCATCTGGCTATTATCAATGATTCGCCGAAATTAGTCTCTTTCCTTATATCGAAAGGTTGTGATCCCATGGAGGAAGACAATGAAGGGAATAATGCACTGCATTATGCTGTCATATGTCAGACTTGCCTCGGACCGCTATTGGAATCCATCAAGAGTAATAACATCAGTTACGATATAAACGCTTACAACAATGAGAAACAGACAGCTTTACATCTATCGTCGGTGTACGGTTGTCGTAAGAGCGCGACCTTGTTGTTGTCTGCTGGCGCGAAATGGGACGCTCGCGATGGAGACGGCCGTACAGCCCTCCACCTCGCCGTACTCGACGACTGTTTACCAGTCGCAAATGAACTGCTAGAGAAGCCGGTGGATGTAGACGCCTTGGATGGAAAAGGCTACACAGCGCTACAGATAGCCTGCGATAGTGTGATCAGAGAAAACACCTTAGAGATTGTGAAACTTTTACTAGACAAGAAGGCTGATCCCCTGAAACACGAAGAAAACAACCATTCCGCGTGGAGGCTGGCGCGAGATAAGCCACAACTAGCAAAACTTATGAGAAAATATATAAGTTCAGAAAAAAATATAGAAATTGATATTAAATCTGAGCCGGAGGACGACGACGACTCGGAAGAAAAGGATACAGAATCGGATCTTACAGCATTACCAATGTATATTGATCGGGTTGCAGTCTTACTGGATAATAATGGAGGCTGGAAGGAATTAATGAGAAGGCTTGATAAGGATTCCTATTACTCCTGGTATAAAGCCACTGACAGTCCCGCAAGAACACTCTTGAACCATCTTAAGGGTTGGAATGAGGGTGTGTCTTCAAAGTCTTTGGCATTGTTGTTGGATGACATCGGACAACATGAAGCAGCAACTATTATTAAAGCCTGTATTGATGAACCAAATAAAAACCTTAAGTGA

Protein sequence:

>DPOGS208891-PA
MSTSEQDVSDSNLESPFSQSDSPYSSPSHQVPQLANIISDLSCAENTNMPKGNMPYLSIVEQPQNHFRFRYKSEMIGTHGCLLGKSYATSKSKSHPTVELRNYTGRALVRCRLARHDSTDEHPHKLLEEDQDRDVHSWLPEKGSYRVAFRGMGIIHTAKKDVPALLYKRYASEKPETAFNENKLRLKCENEAKNINLNIVRLKFSAHDPNTDAEICPPVYSEWIHNMKSAATNDLKICRMSRCYGRPKGKEDVFIFVEKVNKKNIMIKFFELDKYGERVWSKMATFLQSDVHHQYGIVFRTPEYHNLHITSDVKVFIELVRPSDGRTSEPKEFTYKAETIYIQNKKRKANSSFSSIGSSGSSIKSVSDLPTTVEYANQFGEFNNMTNNGIEVMDHPPVAPMYHFKVPQNQIAPTEESMMADALLHSYVGSNPVQSKAMSPLLSQPNVPEPPVLQLHSSELDRVLEQDIDLLSEDKKRFFSTDLGDYFEQYDGVERSTMEWIKSSMMVADSAKRSEKNTKYKESLKSEVKDEPKSNVIRKPPSEYSALYKAEDGVEVQKLIKELCEMMRNKDGIKKQDVRSKLDRLLEMRLSNGDTFLHLTLSSNQPSLEYIVKLIYNMKMTKLLNLKNNQMQTVLHLAIINDSPKLVSFLISKGCDPMEEDNEGNNALHYAVICQTCLGPLLESIKSNNISYDINAYNNEKQTALHLSSVYGCRKSATLLLSAGAKWDARDGDGRTALHLAVLDDCLPVANELLEKPVDVDALDGKGYTALQIACDSVIRENTLEIVKLLLDKKADPLKHEENNHSAWRLARDKPQLAKLMRKYISSEKNIEIDIKSEPEDDDDSEEKDTESDLTALPMYIDRVAVLLDNNGGWKELMRRLDKDSYYSWYKATDSPARTLLNHLKGWNEGVSSKSLALLLDDIGQHEAATIIKACIDEPNKNLK-