Monarch geneset OGS2.0

DPOGS204228
TranscriptDPOGS204228-TA1665 bp
ProteinDPOGS204228-PA554 aa
Genomic positionDPSCF300046 - 656652-660626
RNAseq coverage271x (Rank: top 40%)
Annotation
HeliconiusHMEL0151530.090.48% 
BombyxBGIBMGA007509-TA0.085.82% 
DrosophilaCG11964-PA0.057.95% 
EBI UniRef50UniRef50_E2C2K60.062.95%Beta-catenin-like protein 1 n=9 Tax=Coelomata RepID=E2C2K6_HARSA
NCBI RefSeqXP_967379.10.064.53%PREDICTED: similar to CG11964 CG11964-PA [Tribolium castaneum]
NCBI nr blastpgi|3407196520.064.72%PREDICTED: LOW QUALITY PROTEIN: beta-catenin-like protein 1-like [Bombus terrestris]
NCBI nr blastxgi|3407196520.064.60%PREDICTED: LOW QUALITY PROTEIN: beta-catenin-like protein 1-like [Bombus terrestris]
Group
Gene OntologyGO:00054886e-18binding
KEGG pathwaytca:6557270.0 
 K12864 (CTNNBL1)maps-> Spliceosome
InterPro domain[62-168] IPR0131806.4e-38Domain of unknown function DUF1716, eukaryotic
[118-445] IPR0160246e-18Armadillo-type fold
[134-393] IPR0119897.4e-10Armadillo-like helical
Orthology groupMCL13731 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204228-TA
ATGGATGTTGGGGAATTACTTTCGTTCAAACCGGTACCAACACCTAAACGCCCTAATGAGGAAGATTTGGAGGATTCAGAGGATGAAGGCAAGGGATCAAAGAAAGCAAAGCGATCAAATAAATCCGCAATTAATCGAATGCAGCAATTAGGCAATAAAACCCTACCAAAAGAACCAATAATTACTGATAAGGAAAAAGAAGATATTTTAAGATATGTGGAAACAGAGGCTACAGAGGGAGAAATACTTGATGATACTGCTGTGAAAAAGCTTGTCCTTAATTTTGAGAAGAAAGCCCTCAGAAATAGAGAAATGAGGATTAAGTTTCCTGATCAACCAGAAAAGTTTATGGAGAGTGAAATAGATTTATTTGAAGCTCTTCAGGATTTAAGTGCAGTGGCAACAGTACCAGATCAGTATCCACTATTAGTTGAATTAAAATGCATTAATTCCATACTGGAGTTACTCTCGCATGACAATACTGATGTTTCAACAAAAGTTGTGAATTTGCTTCAGGAACTCACTGATGTAGATATTTTGCATGAGAGTGAAGAAGGTGCAGAGGAGTTGATCAATGCTCTAGCAGAAGCTGAGTGTCCGTCTTTGTTGTTACACAATTTGGCCCGTCTAGATGAACAGGTGCCTGACGAGAGAGATGCTGTTCACAATACTTTAGGTATTATAGAGAACATAACAGAGTTTAGGCCGGAAATGTGTGTGGAGGTTGCAAAACAAGGTTTTATACAGTGGATATTAAAAAGACTGAAGTTAAAAGTACCGTTTGATGGCAACAAATTGTATGCAACCGAAATTTTATCAATTCTGTTACAAAACACACCAGAAAATAGAAAACTGCTTGGTGAATTAGATGGTATCGACGTTCTCAATGTTTTCTACAAGCGTCATGACCCCAGTAGTGCAGAGGAGCAGGAGGCCATGGAGAACATGTTTGACTCGCTGTGCTGTGCGCTCATGGAGCCGCTCAATAGAGACCGCTTCCTTAGAGGCGAAGGCCTGCAGCTTATGAATCTTATGCTGAGAGAGAAGAAAATGTCCCGGAATGGTTCTTTGAAAGTATTAGACCACGCTTTGGCCGGTCCTGATGGTAAGGACAATTGTAACAAATTCGTGGACATCCTCGGCCTGCGGACGGTATTCCCTTTATTTATGAAGACACCGAAGAGGAAGAGAATATTGACCTTAGATCAGCATGAGGAGCACGTCGTATCAATAATATCGTCGATGTTACGCAACTGTCTGGGAAGTCAACGTCAGAGGCTCTTGGCCAAATTCACTGAAAACGACTTGGAGAAGGTAGACAGACTGTTGGAACTGCACTTTAAATATATGGACAAAGTGGATCGTACTGAGAAAGAGATGGAGATGGAAGCCGAAGATTTGGATGACGATGCTCAATACTTGAAGAGATTATCCGGCGGGTTGTTTACTCTACAGCTGATAGATAGAATTATTTTAGAGGTGTGCACCGCTGGTCCGCCGGCCATCAAGCAGAGAGTACAGCGCGTGCTCTCCCTGCGGGGAGGGTCGCTTAAAATTATCAGACACGTTATGAGAGAATACGCCGGCAACCTCGGCGACGCCGGCAGCGAGGACTGGCGCCACCAGGAACAACAACACATACTGCAGCTCGTCGACAAGTTCTAG

Protein sequence:

>DPOGS204228-PA
MDVGELLSFKPVPTPKRPNEEDLEDSEDEGKGSKKAKRSNKSAINRMQQLGNKTLPKEPIITDKEKEDILRYVETEATEGEILDDTAVKKLVLNFEKKALRNREMRIKFPDQPEKFMESEIDLFEALQDLSAVATVPDQYPLLVELKCINSILELLSHDNTDVSTKVVNLLQELTDVDILHESEEGAEELINALAEAECPSLLLHNLARLDEQVPDERDAVHNTLGIIENITEFRPEMCVEVAKQGFIQWILKRLKLKVPFDGNKLYATEILSILLQNTPENRKLLGELDGIDVLNVFYKRHDPSSAEEQEAMENMFDSLCCALMEPLNRDRFLRGEGLQLMNLMLREKKMSRNGSLKVLDHALAGPDGKDNCNKFVDILGLRTVFPLFMKTPKRKRILTLDQHEEHVVSIISSMLRNCLGSQRQRLLAKFTENDLEKVDRLLELHFKYMDKVDRTEKEMEMEAEDLDDDAQYLKRLSGGLFTLQLIDRIILEVCTAGPPAIKQRVQRVLSLRGGSLKIIRHVMREYAGNLGDAGSEDWRHQEQQHILQLVDKF-