Monarch geneset OGS2.0

DPOGS204225
TranscriptDPOGS204225-TA1518 bp
ProteinDPOGS204225-PA505 aa
Genomic positionDPSCF300046 - 695207-697410
RNAseq coverage502x (Rank: top 25%)
Annotation
HeliconiusHMEL0151460.097.23% 
BombyxBGIBMGA007583-TA0.090.34% 
Drosophilaebi-PA0.087.43% 
EBI UniRef50UniRef50_O609070.070.13%F-box-like/WD repeat-containing protein TBL1X n=20 Tax=Eukaryota RepID=TBL1X_HUMAN
NCBI RefSeqXP_317781.40.087.60%AGAP007739-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582975680.087.60%AGAP007739-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582975680.087.80%AGAP007739-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055151.1e-75protein binding
KEGG pathwayaga:AgaP_AGAP0077390.0 
 K04508 (TBL1)maps-> Wnt signaling pathway
InterPro domain[148-504] IPR0110461.1e-75WD40 repeat-like-containing domain
[340-504] IPR0159432.8e-50WD40/YVTN repeat-like-containing domain
[422-460] IPR0197811.4e-13WD40 repeat, subgroup
[421-460] IPR0016806.7e-13WD40 repeat
[6-32] IPR0137201.2e-09LisH dimerisation motif, subgroup
[174-188] IPR0204721.3e-07G-protein beta WD-40 repeat
[4-36] IPR0065949e-06LisH dimerisation motif
Orthology groupMCL10939 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204225-TA
ATGAGTTTTTCCATTGATGAAGTGAATTTTCTCGTGTATCGATACTTACAGGAGTCCGGATTCCATCATTCAGCTTATACATTTGGAATTGAATCACATATATCCCAAAGCAATATAAATGGAGCCTTAGTTCCGCCGGCAGCATTGTTAAATATTCTGCAGAAGGGTTTGCAATATACCGAAGCCGAAATAACTATAGGAGAAGATGGAACTGAAACACGTCTTACAGAGAGCCTGAGTTTAATTGATGCTGTAACTCCGGATATTGTTTCCACTCGTCAAAATGCTCATAATGCTCAAAAGCAAGCTAATAAAGAACCAGGGTCTGGTGGAGAACAAAACGGAGTTGATGGAACAGCATGTAGTGCAGCATCAACTACAGGAGGTACAGTAACACCAAATGTACCAGAAAATATGGATGTTGATCAATCAATCGAAATACCAGCAAGCAAGGCGACAGTACTACGAGGACATGAATCTGAAGTGTTTATTTGCGCTTGGAATCCAAGCACTGATTTATTGGCTAGTGGCTCAGGAGATAGTACTGCTCGAATATGGGACATGTCAGACAATCCAGCAACTACCCCTAATCAGCTTATATTAAGACATTGTATTCAAAAAGGTGGAGCTGAGGTGCCCAGCAATAAAGATGTTACCTCATTGGATTGGAATTGTGATGGTAACTTATTGGCAACTGGATCATATGATGGCTATGCAAGAATCTGGACAACTGATGGCACATTAGCATCTACCTTGGGACAGCACAAAGGTCCTATATTTGCACTTAAGTGGAATAAGAGGGGAAATTATATCTTAAGTGCAGGGGTTGACAAGACAACAATTATATGGGATGCAGCATCAGGCCAATGCACTCAACAGTTTTCTTTCCATGCAGCACCAGCTCTTGATGTTGATTGGCAAACAAACAACTCATTTGCTTCATGTTCAACTGACCAATGTATTCATGTTTGCAGATTACATGTTGACAAACCAATAAAAAGTTTCAAGGGACATACGAATGAAGTCAATGCAATAAAATGGGACCCACAAGGACAACTCCTTGCATCATGTTCAGATGACATGACATTAAAAATATGGTCCATGAAACAAGACACATGGGTTCATGACCTGAAGGCACATTTGAAAGAAATATACACCATAAAGTGGTCTCCTACTGGTCCTGGAACACAAAATCCTAATATGAATTTGATCTTAGCCAGTGCATCATTTGATTCTACGGTGCGCTTATGGGACGTGGAAAGAGGAGTTTGTATTCATACTCTAACTAAACATACTGAACCAGTTTACAGTGTAGCATTCTCTCCCGACGGAAAATTTTTAGCCAGTGGCTCCTTTGACAAGTGTGTTCACATTTGGTCTACGCAGACAGGTGGGCTGGTACATTCTTATAAAGGGACGGGTGGCATTTTTGAAGTATGCTGGAATTCAAGAGGTACAAAAGTAGGTGCCAGTGCGAGTGATGGAAGTGTTTTTGTCCTAGATTTACGCAAATTGTAA

Protein sequence:

>DPOGS204225-PA
MSFSIDEVNFLVYRYLQESGFHHSAYTFGIESHISQSNINGALVPPAALLNILQKGLQYTEAEITIGEDGTETRLTESLSLIDAVTPDIVSTRQNAHNAQKQANKEPGSGGEQNGVDGTACSAASTTGGTVTPNVPENMDVDQSIEIPASKATVLRGHESEVFICAWNPSTDLLASGSGDSTARIWDMSDNPATTPNQLILRHCIQKGGAEVPSNKDVTSLDWNCDGNLLATGSYDGYARIWTTDGTLASTLGQHKGPIFALKWNKRGNYILSAGVDKTTIIWDAASGQCTQQFSFHAAPALDVDWQTNNSFASCSTDQCIHVCRLHVDKPIKSFKGHTNEVNAIKWDPQGQLLASCSDDMTLKIWSMKQDTWVHDLKAHLKEIYTIKWSPTGPGTQNPNMNLILASASFDSTVRLWDVERGVCIHTLTKHTEPVYSVAFSPDGKFLASGSFDKCVHIWSTQTGGLVHSYKGTGGIFEVCWNSRGTKVGASASDGSVFVLDLRKL-