Monarch geneset OGS2.0

DPOGS209543
TranscriptDPOGS209543-TA1392 bp
ProteinDPOGS209543-PA463 aa
Genomic positionDPSCF300092 - 226971-239205
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0165271e-12388.70% 
BombyxBGIBMGA012413-TA1e-17685.15% 
DrosophilaCG11093-PB3e-11283.26% 
EBI UniRef50UniRef50_E2AYU62e-12161.75%Ladybird homeobox corepressor 1-like protein n=4 Tax=Formicidae RepID=E2AYU6_CAMFO
NCBI RefSeqXP_394237.22e-12261.92%PREDICTED: similar to CG11093-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3287789973e-12261.95%PREDICTED: hypothetical protein LOC410761 isoform 1 [Apis mellifera]
NCBI nr blastxgi|3071680931e-12460.05%Ladybird homeobox corepressor 1-like protein [Camponotus floridanus]
Group
Gene OntologyGO:00001661.5e-37nucleotide binding
GO:00056341.6e-37nucleus
GO:00054881.2e-33binding
KEGG pathway 
InterPro domain[29-458] IPR0232164.4e-184Transcription regulator SKI/SnoN
[20-125] IPR0090611.5e-37DNA binding domain, putative
[13-123] IPR0033801.6e-37Transforming protein Ski
[135-228] IPR0109191.2e-33SAND domain-like
[136-228] IPR0148902.6e-29c-SKI Smad4-binding domain
Orthology groupMCL14116 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209543-TA
ATGGAGGCTACAAACCTAGCGATGTCGCCCCGAGAGCCGATCAAGCCTCGAACAGAGCATAAGGCTTTAGAAATTACAGGCGGGAAACCAAATCAAGTCGGTACAGTTCTGCTATACGGCGTACCGATTGTATCCCTAGTTATTGAAGGTACGGAGAGATTGTGTCTCGCTCAGATATCAAACACTCTGTTGAAGCAGTTCTCATACAACGAGATTCATAATAGGAGAGTGGCGCTGGGTATCACCTGTGTCCAATGTACGCCTGTACAGCTGGAGATATTGAGGCGAGCCGGCGCTATGCCCGTCTCGTCTAGACGATGCGGTATGATAACACGTCGTGAGGCAGAGCGTCTTTGCAAGTCTTTTTTGGGAGACAACGCACCACCTCGTCTCCCTGATGACTTCGCGTTTGCTGTCCATCACGAATGCGCTTGGGGCTGTCGCGGAGCATTCCTGCCAGCGAGGTACAATTCATCACGCGCGAAGTGCATCAAGTGCGCTTACTGCGGTCTGTTTTTTTCGCCAAACAAATTCATCTTCCACTCTCACCGCGTCGGTCCCGGTGATAAGTATGTACAGCCTGACGCTGCTAACTTCAACTCCTGGAGACGGCACATGAAGCTGAGCGGCAGTCCGCCCGTGGAGGTTGTACACGCTTGGGAAGACGTTAAGGCTATGTTCAACGGCGGAACTAGGAAACGTATGCTATCAGCCGCTAGCATCGTTTTCGGTCTTAAATTCCGCATCAGACGCGTTGTTCGCAGGGAGGAACCGGAGCCGAAGCGAGCGGCGTCGAGTCCCCCAGCGCCGCCCGTGCCGAGACTAGCGGACTATGTATGGGGAGCTCGCCTACCACTGCCGTACGCTATACCCTGGCTGAAACCAACCTTATGGTCTCCTGGTGCCCTAACAGCCTTGAGCAGTGTAGGTGGTATAGAAGAGCCGCGAGCTTTCCGTCCAGTCCGCTCTCATCGTCTGGAGTCACCACAACTCTCACCAACAGTGTCTCCCTCTATAACACCATCCAGGTCCCCGACTACATCTCCTCGACAATCTCGCTCACCCCTCCGCTCCCCGCTGCGCTCGCCGCTCCGCTCATCACCGGCGCGCTCGTCTAAAAGTGATAGAGACAGTGACGAGAGTGTCGATATTGAGACCACAGAGGAAGATCAAATTAAGGACGCGAGATGTACCTGGCCGTGTCGTGGAATAGCCAGGCCTCGTGAAGAATGGGGAGCGCTGGTTCCAAAGGTGGAGAGAGAGGAGGAACCGTGGCGGCTTCCTGATCTTCGTCCACCGCTGCATTATTTGCATGATCGCGAGAGGGAAAGCTGTGCAGCTTGCGTGGCACCTTTAACGCTGCTACCGCCACCTACCACAGCGCCGCACTAA

Protein sequence:

>DPOGS209543-PA
MEATNLAMSPREPIKPRTEHKALEITGGKPNQVGTVLLYGVPIVSLVIEGTERLCLAQISNTLLKQFSYNEIHNRRVALGITCVQCTPVQLEILRRAGAMPVSSRRCGMITRREAERLCKSFLGDNAPPRLPDDFAFAVHHECAWGCRGAFLPARYNSSRAKCIKCAYCGLFFSPNKFIFHSHRVGPGDKYVQPDAANFNSWRRHMKLSGSPPVEVVHAWEDVKAMFNGGTRKRMLSAASIVFGLKFRIRRVVRREEPEPKRAASSPPAPPVPRLADYVWGARLPLPYAIPWLKPTLWSPGALTALSSVGGIEEPRAFRPVRSHRLESPQLSPTVSPSITPSRSPTTSPRQSRSPLRSPLRSPLRSSPARSSKSDRDSDESVDIETTEEDQIKDARCTWPCRGIARPREEWGALVPKVEREEEPWRLPDLRPPLHYLHDRERESCAACVAPLTLLPPPTTAPH-