Monarch geneset OGS2.0

DPOGS203203
TranscriptDPOGS203203-TA1836 bp
ProteinDPOGS203203-PA611 aa
Genomic positionDPSCF300035 + 635838-639490
RNAseq coverage184x (Rank: top 49%)
Annotation
HeliconiusHMEL0157510.085.59% 
BombyxBGIBMGA011089-TA0.080.50% 
DrosophilaScm-PA5e-14151.81% 
EBI UniRef50UniRef50_E3UKP80.082.24%Sex comb on midleg n=1 Tax=Biston betularia RepID=E3UKP8_9NEOP
NCBI RefSeqXP_966529.21e-17654.67%PREDICTED: similar to lethal(3)malignant brain tumor [Tribolium castaneum]
NCBI nr blastpgi|3085128070.082.24%sex comb on midleg [Biston betularia]
NCBI nr blastxgi|3085128070.082.47%sex comb on midleg [Biston betularia]
Group
Gene OntologyGO:00056341.6e-49nucleus
GO:00063551.6e-49regulation of transcription, DNA-dependent
GO:00055153.2e-17protein binding
GO:00082706.1e-08zinc ion binding
KEGG pathway 
InterPro domain[240-341] IPR0040921.6e-49Mbt repeat
[533-608] IPR0137612e-26Sterile alpha motif-type
[541-603] IPR0109933.2e-17Sterile alpha motif homology
[540-604] IPR0211291.5e-13Sterile alpha motif, type 1
[538-606] IPR0016605.8e-10Sterile alpha motif domain
[23-59] IPR0105076.1e-08Zinc finger, MYM-type
Orthology groupMCL12119 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203203-TA
ATGTCAAACAATACTGTCGGCTCGTCAGGGCATGCTCCCGGCAAAATACGTGGACCTGGAAGACCCCCTAAACGGACGTGCACTTGGTGTGCTGAGAGTAAAACGCCTTTGAAGTATGTTTTGCCTACGGAAAATGGGAAAAAAGAATTTTGTTCAGAAACGTGTTTGTCAGAATTCCGACAAGCATATAGTAAAGGCGCTTGTCTTCACTGTGACAATGTTATACGCGGCAATGCTCCATCCAGTAGCAAAAATTTTTGTTCGACGTATTGTTTAAATAAATATCAAAAGAAAAATGAGAAGAGAACAACTTCGCCGCAATCAGGGAATGGAGCGAACGGTACCGAACCTCATCAGAACAATAATTCGACAGGATCGTTCTATGACATATACCAGTCGTTTGATTGGAATGAATATATGAAGGAAACTAATAGTGTTGCTGCACCCCAAGAGTGTTTTAAGCAAGCTCCAAATCCTCCAGTGAATGACTTTAAAGTTAATATGAAACTCGAAGCTTTGGACCCTCGTAATTTGACATCAACTTGCATTGCTACAGTTGTTGGTGTATTGGGTCCGAGATTGAGGCTTAGACTTGATGGCAGTGATAATAAAAATGATTTTTGGAGGCTTGTTGATGCTGGTGATATTCATCCTATAGGTTATTGTGAGAAAAATGATGGTATGTTGCAACCACCTCTTGGTTTTCGTATGAATGCCAGCAGTTGGCCTATGTTCTTGCTGAAAACATTAAATGGGGCGGAGATGGCTCCATCAAAGGTCTTTCAACCTGAACCACCTACTCCTAAATCAAATTTGTTTGTTGTTGGTCAAAAATTAGAAGCTGTTGATAAAAAAAATCCACAACTTATATGTTGTGCAACTGTTGGTGCCGTGAAAAATGATCAGATACATGTTACTTTTGATGGTTGGAGGGGGGCCTTTGATTACTGGTGTAAATATGACTCTCGAGACATATTTCCTGTTGGCTGGTGTGCAAGAGCAGGTCACTTATTACAGCCACCTGGTCAAAAAAGTGCTACAGCGCCTTCTAGATTTAAATTGCGTCCCAGTGGTATTCCTAATCCAGCTTTACCAGAAGGGGGATCAACTGGTACAGGCAATGCAAATGGAGCTAACACTGTAACTCCATTAGCAAATGTTGTTTTACGTATCCGTAATAGTTGTTCTGGAGGGAACGCCGCCTTACCATCCTCCATCACCGGTGTCGGTGCTTCAGGTGTAGCTGAAAACCTAGTTAAAGAGTTACTTGTTACATATACAGATCCTCAAAAACTTACGAGGGCAATACTTACTGCATCAAATAGCTATTCAAATAATACTAATGTACAGGTATCAGTTGGCAATAAGAATTACCCAGTTAAAGTGCCTCAGGAATTATCAACCGAAGACCTAAAGAATTGGTTAAAATTAGTGTGTAATGGTATCGGGTGCTGTGTGGGAATGATAGAAATAGATACAGGTGAAGCCGCAGGACGGCCTTGTACACTGTGTGGTGAATCTACCTCAAATACTGTCACATCTGCAGTTAAAAGACCGAAAAGTGTAAGTAAATTATGTTTTATTCAACAATCCCCTGCCCCCGCCGCCGCCCCCGCTGACTGGTCCGTGGAGGATGTCATCGGATTTATCGCTGCAGCTGACCAAGCACTTGCCGCCCATGCTGATCTATTCAGAAAGCATGAAATAGATGGCAAGGCGCTCTTGTTATTGAATTCTGACATGATGATGAAATACATGGGTCTGAAACTTGGTCCCGCCTTAAAAATATGCAATTTGGTATCTAAAATAAAAAATCGTCGACATTATAGCACCTAG

Protein sequence:

>DPOGS203203-PA
MSNNTVGSSGHAPGKIRGPGRPPKRTCTWCAESKTPLKYVLPTENGKKEFCSETCLSEFRQAYSKGACLHCDNVIRGNAPSSSKNFCSTYCLNKYQKKNEKRTTSPQSGNGANGTEPHQNNNSTGSFYDIYQSFDWNEYMKETNSVAAPQECFKQAPNPPVNDFKVNMKLEALDPRNLTSTCIATVVGVLGPRLRLRLDGSDNKNDFWRLVDAGDIHPIGYCEKNDGMLQPPLGFRMNASSWPMFLLKTLNGAEMAPSKVFQPEPPTPKSNLFVVGQKLEAVDKKNPQLICCATVGAVKNDQIHVTFDGWRGAFDYWCKYDSRDIFPVGWCARAGHLLQPPGQKSATAPSRFKLRPSGIPNPALPEGGSTGTGNANGANTVTPLANVVLRIRNSCSGGNAALPSSITGVGASGVAENLVKELLVTYTDPQKLTRAILTASNSYSNNTNVQVSVGNKNYPVKVPQELSTEDLKNWLKLVCNGIGCCVGMIEIDTGEAAGRPCTLCGESTSNTVTSAVKRPKSVSKLCFIQQSPAPAAAPADWSVEDVIGFIAAADQALAAHADLFRKHEIDGKALLLLNSDMMMKYMGLKLGPALKICNLVSKIKNRRHYST-