Monarch geneset OGS2.0

DPOGS202501
TranscriptDPOGS202501-TA2217 bp
ProteinDPOGS202501-PA738 aa
Genomic positionDPSCF300131 - 444441-485006
RNAseq coverage1152x (Rank: top 11%)
Annotation
HeliconiusHMEL0074300.087.82% 
BombyxBGIBMGA012449-TA5e-17163.57% 
Drosophilagro-PE0.063.02% 
EBI UniRef50UniRef50_Q7PMQ20.070.11%AGAP010324-PA n=17 Tax=Coelomata RepID=Q7PMQ2_ANOGA
NCBI RefSeqNP_001128361.10.093.81%groucho [Bombyx mori]
NCBI nr blastpgi|2010253900.093.81%groucho [Bombyx mori]
NCBI nr blastxgi|2010253900.093.81%groucho [Bombyx mori]
Group
Gene OntologyGO:00055151.8e-70protein binding
GO:00056342.6e-57nucleus
GO:00063552.6e-57regulation of transcription, DNA-dependent
KEGG pathwaytca:6564960.0 
 K04497 (GROUCHO)maps-> Wnt signaling pathway
    Notch signaling pathway
InterPro domain[20-139] IPR0056171.8e-70Groucho/TLE, N-terminal Q-rich domain
[404-738] IPR0110461.3e-69WD40 repeat-like-containing domain
[451-736] IPR0159431.7e-65WD40/YVTN repeat-like-containing domain
[637-659] IPR0091462.6e-57Groucho/transducin-like enhancer
[573-612] IPR0016803.1e-08WD40 repeat
[576-612] IPR0197815.8e-08WD40 repeat, subgroup
Orthology groupMCL10141 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202501-TA
ATGTATCCTAGCGCGGGCGCGATGAACGCCGCCGCTGCCGCCGCTGCCGTGGCGGCCGCCAGGCATCCGGGCCCCCCGCAGCCCGGGCAACCCATCAAGTTCACGGTGGGCGAGTCATGTGACAGGATTAAGGAGGAATTTAATTTCTTACAAGCTCAATATCATAATTTAAAATTAGAATGCGAGAAACTGGCTAGTGAAAAAATTGAAATACAGAGGCATTATGTTATGTACTATGAAATGTCATACGGGCTCAACGTGGAAATGCACAAACAGACGGAGATCGCTAAGAGATTAAATGCTATAATAGCTCAAATATTGCCATTCCTCTCTCAAGAGCATCAGCAGCAAGTGGCGTCGGCAGTGGAGAGGGCGAAGCAGGTCACGATGACAGAACTGAACGCTATTATTGGGGTCGGCTGGCCGCGCGCCAACCGCCGACATCCAACATCCAACATCCAAGAGCGAACGACGAACAACGACCAACGCGATTTTCAACAGCGACCAGACCTGCCGCGTCTCTTGCAGCAGATGCATGCAGCACATTTGCCGGCACACGGAGCTCCGCCACTGCCTCTTCTCAGCCAAGGAGCCCTGCCGCCCGCGGGGTTACTGGGCCTCGGAGTACCCCACCATCCTCTGTCAGTGCTCGCCAAACCCCCCGACATACATCGTCCTGATGATAAGGGCAATGGTATCAGCTCGGCGGAAGAGCGACACAGAAATTCAATATCCCCGGGCGAGAGAGAGAAATATAGAACAAGGAGTCCCGCTGAACCAGATCACAAGAAACTAAAAAAGGAGGAAAAAGATATGGGACATGAATTAGTTGTAGACGACGCCAGCGAAGAACCCACATCACCTCACAACGGGGCGCCTTCACCCAGAGAGAACGGTCTGGACAAACTTCAACCCAAGAAAGAACATCCCCCTCACAGTCCGCGGTCTGGAACGTCCAGTAACGCATCGACGCCTTCGACAAAAAAGTTAGACGAGAAACCCAGCACGCCGATTTCAAAACCGGTGACGCCGACTTCCGGCGCTAGTGGCGTCGGCTCGGCGGGGCCACCTATGAAGGCGGCGGTGAAGCCCCCGGCGTTACAGTACCCCTACCTAGGTAACGGGGCCCACGACGCATACGGACTTGCCGGATATTCAGCCAGAGCGGCGATGGCGTACGAGCCACTACGTCCCCCAATAGGACCAGCGGCTCTGGCACCCATACCTGGCGGAAAACCAGCGTACTCGTTCCACGTATCGGCCGAGGGCCAGATGCAACCGGTCCCATTCCCCCCGGACGCCCTCATGGGGCCGGGGATCCCCCGCCACGCGCGGCAGGTGTCCGCCCTCGCCCACGGGGAAGTGGTGTGCGCGGTGACAGTCTCCTCGCCAACCAAGTACGTGTACACCGGCGGTAAGGGCTGCGTCAAGGTGTGGGACATCAGCCAGCCGAGCAAAGCGCCCGTCAGCCAGCTGGATTGTTTGCAACGTGATAATTACATCCGGTCGGTGAAGTTACTTCCTGACGGCCGGACCTTGATTGTCGGCGGGGAAGCCTCCAACTTGTCTATATGGGACCTCGCTTCTCCGACTCCCCGCATTAAGGCGGAACTGACGTCATCAGCGCCCGCTTGTTACGCGCTGGCTATTAGCCCAGACTCTAAGGTGTGCTTCAGTTGTTGTTCCGACGGCAACATCGCGGTGTGGGACCTCCACAACCAGACCCTGGTGAGACAGTTCCAGGGACACACGGACGGAGCCTCATGCATCGACATCTCCGCTGACGGCACCAAGCTTTGGACGGGCGGACTTGATAATACTGTCAGATCCTGGGATTTAAGAGAAGGAAGACAATTACAACAGCACGACTTCAGCTCACAGATATTCTCACTGGGATACTGTCCGACGGGTGAATGGCTCGCAGTGGGCATGGAGAACAGCAACGTGGAGGTGTTGCACGCCGTGAAGCCTGACAAGTACCAACTGCACCTGCACGAGTCCTGTGTACTTTCCCTCAGGTTCGCCTCCTGCGGGAAGTGGTTCGTCTCCACGGGGAAGGACAACCTGCTCAACGCCTGGCGCACGCCCTACGGGGCGAGCATCTTCCAGTCTAAGGAGTCGTCGTCGGTGCTGAGCTGCGACATCTCATCGGACGACAAGTACATAGTGACCGGGTCAGGCGACAAGAAGGCCACAGTGTACGAAGTGATCTACTAA

Protein sequence:

>DPOGS202501-PA
MYPSAGAMNAAAAAAAVAAARHPGPPQPGQPIKFTVGESCDRIKEEFNFLQAQYHNLKLECEKLASEKIEIQRHYVMYYEMSYGLNVEMHKQTEIAKRLNAIIAQILPFLSQEHQQQVASAVERAKQVTMTELNAIIGVGWPRANRRHPTSNIQERTTNNDQRDFQQRPDLPRLLQQMHAAHLPAHGAPPLPLLSQGALPPAGLLGLGVPHHPLSVLAKPPDIHRPDDKGNGISSAEERHRNSISPGEREKYRTRSPAEPDHKKLKKEEKDMGHELVVDDASEEPTSPHNGAPSPRENGLDKLQPKKEHPPHSPRSGTSSNASTPSTKKLDEKPSTPISKPVTPTSGASGVGSAGPPMKAAVKPPALQYPYLGNGAHDAYGLAGYSARAAMAYEPLRPPIGPAALAPIPGGKPAYSFHVSAEGQMQPVPFPPDALMGPGIPRHARQVSALAHGEVVCAVTVSSPTKYVYTGGKGCVKVWDISQPSKAPVSQLDCLQRDNYIRSVKLLPDGRTLIVGGEASNLSIWDLASPTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISADGTKLWTGGLDNTVRSWDLREGRQLQQHDFSSQIFSLGYCPTGEWLAVGMENSNVEVLHAVKPDKYQLHLHESCVLSLRFASCGKWFVSTGKDNLLNAWRTPYGASIFQSKESSSVLSCDISSDDKYIVTGSGDKKATVYEVIY-