Monarch geneset OGS2.0

DPOGS206371
TranscriptDPOGS206371-TA2616 bp
ProteinDPOGS206371-PA871 aa
Genomic positionDPSCF300192 - 245636-250056
RNAseq coverage126x (Rank: top 57%)
Annotation
HeliconiusHMEL0090210.077.24% 
BombyxBGIBMGA005770-TA0.078.99% 
DrosophilaSfmbt-PC3e-3125.42% 
EBI UniRef50UniRef50_F4X1556e-13133.22%Scm-like with four MBT domains protein 1 n=5 Tax=Myrmicinae RepID=F4X155_ACREC
NCBI RefSeqXP_002730907.11e-9232.93%PREDICTED: Scm-like with four mbt domains 1-like, partial [Saccoglossus kowalevskii]
NCBI nr blastpgi|3800258391e-13935.57%PREDICTED: scm-like with four MBT domains protein 2-like [Apis florea]
NCBI nr blastxgi|3800258393e-14535.31%PREDICTED: scm-like with four MBT domains protein 2-like [Apis florea]
Group
Gene OntologyGO:00056345.5e-26nucleus
GO:00063555.5e-26regulation of transcription, DNA-dependent
GO:00055156.2e-13protein binding
KEGG pathway 
InterPro domain[3-99] IPR0040925.5e-26Mbt repeat
[462-574] IPR0219877e-25Protein of unknown function DUF3588
[786-858] IPR0137612e-14Sterile alpha motif-type
[789-858] IPR0109936.2e-13Sterile alpha motif homology
Orthology groupMCL17190 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206371-TA
ATGGAATTTGAATGGAATAATTATTTAGAAGATACAAAAACTATTGCTGTTCCCGAAGAATTATTTTATCACGTTGAAGCCAGCCTCAACAATGGTATCAAGCAGGGGATGTTACTCGAGGTGTGTCACAAGAACAATCCTGATGTATACTGGTTGGCTGAGATCACAATGGTCTGCGGCCACTTGCTTAGGATAAAATTCATTGGTGCCCAGACCGACTTTTGGTGTGATATATCCAGTACTAAAGTCCACCCTCTCGGCTGGTGTGGAAAATATGATGAATTGGTTGAGCCTCCCGATGAGATAAACGAAAGATGCGGAGAAACTATCATAGATATAATGAAAAAAGCCCTCCTTGTTGGACAATCGGTTTCACTCGAGGCATTGAATAACAAAGGGATGTCTCCAATCGATCGAATCAAAGTTGGGATGAAAGTGGAAATACAGAATATAATTGATCCATACAGATACTGGATTGCAACTGTGTGTGAAAACATTGGAGGTCGGCTCTTGTTGAGGTATGATGGAGCTGATGAAGATTTACCACAGTTTTGGATGTTCTTCTGCAACACCAGACTCAGCAGCTTTGGATTTGTCACCAACAAGGGTTCTCCGTGGCAGTTCAAGTACCCAGGCAAAGTTAATAAATTCTCGTGTAAGAACAAACTCAGCACGCAACTGAGACAAAGCGCCGAGGAATCTATCAAAGAACCAACTCCAGCTGATCTGTTTCAGCCAAATCCGATCTTAGAGGCGCACAGTTTTGCTACAGGCATGAAAGTAGAAGCACTAAGTCCGAACGACATGAAGACTTTCCGCCCTGCCACGGTAACCAAAATCTTCAATAATCTCCATTTCTTGGTCGTCATAGACGATCACCTAGAAGATTACGAGGACACCAAAATGGCCTGGCTTTGTGATAACATGCACCCCTACATTTACCCCATCGGCTGGGCACAATCACACAAACTTGATATTAAGCCACCTAAAGTGTGGAAGGAAGGTGTATTTGAATGGGAGGATTACCTTGCGATGACCGCCTCCGTCCCCGCGCCGGAATACTGTTTCGGAAACAAGGAACAGCTTAAAGGAATCGAAGCCAATATGAAGTTGGAAGCGGTGAATCCTCTGAACCACGAGGAAATCCACGTAGCTTCGGTCGAATTAATAGTGGAACACATGTTGTACGTCGAACTTTTGCCGATCGGCGAAAAGTTCTGGTACTCCCAAGATAGCGATCTCTTGTTCCCCGTAGGATGGTGTGACAGCAACAACTATGAGCTCCATATACCAGACACCAACCCAAAAGAAATACTCAAGCCCGTCGAGGAGCCCAAAACGATCAAAGATGACATCAAATCATCGGAAGAGTGGTGCGATAGAATATTTTTCAATTATAAATGTTATGCCGGTCCGTCGATAAGTCGTAACAAATTATCACAGCTGCCCAAAGCTGTGGGCCCTGGACCCTTGCTGCTTGTACTAAAAGAAGTACTCAACAAAATCATCTCGGCCTCATACAAACCGGCGAAATTGCTCAAAGATTGGGAAACTGAAGGTCCGCCAGACGAAGGCATGAAACTAGAAATGCTAAGAGCCAAATTAAAAGCGAGCACGTACCACGCGTTCGTCCCCATAACGACCGAGGCGTCCAAGGTGGGCTCGTTCTGTCGGTCCATATGTGTCAAGCTACAGGCCTGTCCCAGTCTGTTCGGACCTGACGAGTACCCTCTGCAATGTCCGCACGCCTGTCAGACGGTCGAGAAGTCAACCTTCCATAATGGAACAGAACGACGAGGCAGGCCGAAAGGTAGCGTGAATGGAAGGAAAAAAAAGAAAAAAACACAGCAGGAGAAGAGGGAAAAGGAGCAACCGCCAGTACAGGAGATTGAGCACAGAGATATAGAGTCGGTTGAAAGCGAACACAGCGCCGGGTCCACACCGCCCTCGGAGTCGGGCACTAGGCCGAACACACCAGAGAGCGTCACAGACTACAAGAGGAATACCAGGAGAAAACGTGACCCCAAGAACAGTTACCCCAAATTAGAAATGAAGACACGTGGAGCTAAGTTACCAAATTTCGCGCTTCAGATGAAAGAAGCGCACTGGGACAAGAAGGACGTGGAGACGATTTACAGCAACTCGTGCGTCAGTAAGAAAAGAAATACACAGGACAGCGACACAGAGAACGAGACCAACGAGAGTAGCTGCAATTCGAGAGACTCCAAAAACATATCGGACGTCAGCGACTCGGAGGAACCGGAACTGAAGAAACTCAAGTTCAACACCAACGATCCGCTGCCGTCAGATAACAAGATGTTCGAGAAGAACACAACCACCGCCTGGGCCAGGGGGAAAATGAAGCTGGCGAGGAACCCCCTGGACTGGACGGTCGACGACGTCTACAACTATCTGAGCAACACCGACGACTGCAAACTGATAGCGGACAAGATGAAGCAGGAAGAAATAGACGGCCAGGCCTTCATCATGCTGGACCTACCCATCATAACGCACTTCCTGCACATGAAGAAGGAGTTCGCTATGCAGCTCTGCAAACACATCACCATGATACGGTGGTACTACATCGACAACTTCGACGACAACGCCGACATGTAA

Protein sequence:

>DPOGS206371-PA
MEFEWNNYLEDTKTIAVPEELFYHVEASLNNGIKQGMLLEVCHKNNPDVYWLAEITMVCGHLLRIKFIGAQTDFWCDISSTKVHPLGWCGKYDELVEPPDEINERCGETIIDIMKKALLVGQSVSLEALNNKGMSPIDRIKVGMKVEIQNIIDPYRYWIATVCENIGGRLLLRYDGADEDLPQFWMFFCNTRLSSFGFVTNKGSPWQFKYPGKVNKFSCKNKLSTQLRQSAEESIKEPTPADLFQPNPILEAHSFATGMKVEALSPNDMKTFRPATVTKIFNNLHFLVVIDDHLEDYEDTKMAWLCDNMHPYIYPIGWAQSHKLDIKPPKVWKEGVFEWEDYLAMTASVPAPEYCFGNKEQLKGIEANMKLEAVNPLNHEEIHVASVELIVEHMLYVELLPIGEKFWYSQDSDLLFPVGWCDSNNYELHIPDTNPKEILKPVEEPKTIKDDIKSSEEWCDRIFFNYKCYAGPSISRNKLSQLPKAVGPGPLLLVLKEVLNKIISASYKPAKLLKDWETEGPPDEGMKLEMLRAKLKASTYHAFVPITTEASKVGSFCRSICVKLQACPSLFGPDEYPLQCPHACQTVEKSTFHNGTERRGRPKGSVNGRKKKKKTQQEKREKEQPPVQEIEHRDIESVESEHSAGSTPPSESGTRPNTPESVTDYKRNTRRKRDPKNSYPKLEMKTRGAKLPNFALQMKEAHWDKKDVETIYSNSCVSKKRNTQDSDTENETNESSCNSRDSKNISDVSDSEEPELKKLKFNTNDPLPSDNKMFEKNTTTAWARGKMKLARNPLDWTVDDVYNYLSNTDDCKLIADKMKQEEIDGQAFIMLDLPIITHFLHMKKEFAMQLCKHITMIRWYYIDNFDDNADM-