Monarch geneset OGS2.0

DPOGS200990
TranscriptDPOGS200990-TA882 bp
ProteinDPOGS200990-PA293 aa
Genomic positionDPSCF300147 - 272261-285965
RNAseq coverage732x (Rank: top 18%)
Annotation
HeliconiusHMEL0130561e-8782.23% 
BombyxBGIBMGA014212-TA4e-9580.63% 
DrosophilaSnoo-PB2e-5448.21% 
EBI UniRef50UniRef50_D6WYN43e-6250.00%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WYN4_TRICA
NCBI RefSeqXP_966566.22e-6254.04%PREDICTED: similar to nuclear oncoprotein skia [Tribolium castaneum]
NCBI nr blastpgi|2700119871e-6150.00%hypothetical protein TcasGA2_TC006082 [Tribolium castaneum]
NCBI nr blastxgi|2420220716e-6158.74%transforming protein Ski, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00056348.3e-40nucleus
GO:00001666.3e-38nucleotide binding
GO:00054887.4e-26binding
KEGG pathway 
InterPro domain[18-213] IPR0232164.4e-94Transcription regulator SKI/SnoN
[20-131] IPR0033808.3e-40Transforming protein Ski
[26-131] IPR0090616.3e-38DNA binding domain, putative
[140-229] IPR0109197.4e-26SAND domain-like
[142-220] IPR0148901e-23c-SKI Smad4-binding domain
Orthology groupMCL14244 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200990-TA
ATGGCCGGGCCGCCGCTGGGAACGGGAGCGCGCTCGGCCGCCGCGCCGCCGCCGCCGCCGCCGCCCCGGCCCGCGCCGCCCATCCTGCTGGCCGCCGACCCGGTGCTGGGCGAGCGCGGAGAGACCGCCCTCGAGGGCGAGCCGATATCGTGCTTCAGCGTGGGCGGAGAGAGGCGCCTGTGCCTGCCACAGATCCTGACCTCGGTTCTAACGGACTTCACGCTGGAGCAGATCAACAGGGTGTGCGACGAGCTGCAAATCTACTGCTCGCGCTGCACGCCGGAGCAGCTGCACGAGCTCAAGTCGGAGGGCGTGCTGCCCCGCTCGGCGCCGTCGTGCGGCCTCATCACACAGACGGACGCGGAGCGCCTGTGCGCGGCGCTGCTGCACGCTCCGGCCTCCGCCCCTCGCCGCCTGGGGGGCTTCCGCGTGTATCACGAGTGTTTCGGGGGCGCGAGCGGGGTGTGCTCGCCGGAGCTGGGGCTGGTGCAGTGCGCGGAGTGCCGCGGAGTGTTCGCGCCTCGCCGCTTCGTGTGTCACTCTCACGGCACGGAACACCGCACGTGTCACTGGGGCTTCGACTCTTCCCGCTGGAGACGCTTCGTGCTGGTGTCGGAGGAGGAGCGGGACAAGGACGAGTGCGGCGCCCTGCTGGACGAGCTGGCCGCCAGGGAGGCCGCCGCGCCGGCACACGCGCCCCCTACTGTGGTGAAGCAGCACGCGCCGCTCAAGAGGAAACAGCTAGATAGCGCTGTGGAGTCGTGGACCGACACTACCTGGAATATTGATGCGTTGCAGGATACACTCGCATTCTGCATAATGCATCGCCGCGTCATTGATATTCAAACTTTAATCTCACCAGCGGGGCCGGGCGGTTTCTAG

Protein sequence:

>DPOGS200990-PA
MAGPPLGTGARSAAAPPPPPPPRPAPPILLAADPVLGERGETALEGEPISCFSVGGERRLCLPQILTSVLTDFTLEQINRVCDELQIYCSRCTPEQLHELKSEGVLPRSAPSCGLITQTDAERLCAALLHAPASAPRRLGGFRVYHECFGGASGVCSPELGLVQCAECRGVFAPRRFVCHSHGTEHRTCHWGFDSSRWRRFVLVSEEERDKDECGALLDELAAREAAAPAHAPPTVVKQHAPLKRKQLDSAVESWTDTTWNIDALQDTLAFCIMHRRVIDIQTLISPAGPGGF-