Monarch geneset OGS2.0

DPOGS204512
TranscriptDPOGS204512-TA3582 bp
ProteinDPOGS204512-PA1193 aa
Genomic positionDPSCF300205 - 108124-119838
RNAseq coverage577x (Rank: top 22%)
Annotation
HeliconiusHMEL0089790.078.39% 
BombyxBGIBMGA012514-TA0.072.06% 
DrosophilaCG7261-PA0.040.43% 
EBI UniRef50UniRef50_D6X4J70.045.08%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X4J7_TRICA
NCBI RefSeqXP_969240.10.045.08%PREDICTED: similar to beta-tubulin cofactor D [Tribolium castaneum]
NCBI nr blastpgi|910918720.045.08%PREDICTED: similar to beta-tubulin cofactor D [Tribolium castaneum]
NCBI nr blastxgi|910918720.045.08%PREDICTED: similar to beta-tubulin cofactor D [Tribolium castaneum]
Group
Gene OntologyGO:00054881.9e-41binding
KEGG pathwaybmy:Bm1_493954e-122 
 K12864 (CTNNBL1)maps-> Spliceosome
InterPro domain[317-1055] IPR0160241.9e-41Armadillo-type fold
[893-1087] IPR0225773.3e-41Tubulin-specific chaperone D, C-terminal
[663-678] IPR0119895e-19Armadillo-like helical
Orthology groupMCL12083 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204512-TA
ATGGTGGCCCTCGATGGAGACATAAATAAGGACGACGACTGTGAAAATATTGGACTTGGATGCGCTCTAGAACACTTTTCCGAAGTCGAAGATGTACTGAATATGATAGAAAATGTGAAAAATATTTATAACACACCAACATTCGAAGTCGAATATGACAAATTGTACACAATTTTGAAACAGTATTATGAACAACCGCATTTACTTGATCCACATTTGGATAAACTACTCGCGAAATTTATGTCTATTATTAAGGATAAGGAAAGCCCGATTGAGTTAAAAAATGCAACATTTAATTACATGTATCAAATTATTCGAGTGAGGGGCTACAAGGTGGTTGTGAGACATCTCCCCCATGAGGTGTCAGATCTTCTAACAGTGCTTTCCTCTCTAGAAGCCCAGGATCCAAATGATAAAGAAACCTGGAGAAGCAGATTTGTCTTGTTGCTCTGGCTTTCCATTGTAGTGATAATACCATTCCATATGAGTAGACTGGATGGATTTGCCCCTAATGCATCAGGGGCTGGTTCTTCGAAGAAGTTAACAGTTATGGAGAGAATACTCAATATTTGTAAGACTTATGCATTAAGTAAAGACAGCTGTGCTGAAGCCAGTGCTTATTTGGCATCAAAGTTTTTAATAAGATCTGATGTTAAAGAGTTGTACATGAGCCAGTTCTTTGACTGGGCCTGTGAGTTACATTCCAATATACAAGAGGAAGAAACCATTCATTATGGAGTCCTAGCCGCTGTTGCAGCTGTGTTAAAACATGGCAAACGAGACGACCTCCTGCCTTTCACCCCTAAGTTACTGGAATGGGTTACCACTCAGAACTACCAGCAACATAAGGCTATGCTGGTGCGGAAGTATGGTGTCAAGATTGTGCAGAGAATCGGTCTAACATTCCTCCGCCCGCGCGTGGCGTCCTGGCGTTACACCCGCGGCGCTCGCTCGCTGGCCGTCACACTTGGAGCTGCAGCCGCCGGAGACAACGAACCGATGACTGTAGACCCCGATGATGACGACCAAGACATACCGCAGGAGGTGGAGGATGTAGTGGAACTGCTCCTTCGTTCTCTTCGCGATGAAGACACAGTCGTGCGTTGGTCGGCCGCGAAGGGAGTGGGTCGTATCGGAGCGCGCCTACCCGCCATGGCCGCCGCCGACGTGTGTGACAGTGTACTGACGCTGTTCGCTGACAACGAACGCGACACCGCCTGGCACGGGGGCTGTATGGCTCTGGCTGAACTAGGTCGTCGTGGTCTCATCTCTCCGCGCCAGCTGTCGTCCACGGTCCGTTGCTGTTCAGCGGCCCTGGCTCGTGACGAGCCCCGCGCGTCAGGCGGAGGTGGCGGCGGCGGTGGTAGGGCTGCCAGAGACGCGGCCTGCCATGCTTCCTGGGCCATTGCTAGAGCCTACGACGCGACAGCCCTGACACCGCACGCTACAGTGCTAGCGAACGCGTTGATAGCCACCGCCTGTTTCGACAGAGAGATAAACTGCCGCCGCGCCGCGTCCGCCGCTTACCAGGAGAACGTCGGTCGCCACGGGTTGTTCCCCCACGGGATCGACGTGCTGACCGCCGCCGACTTCCAGTCCGTGGGTCCGCGGAGCCACGCCTACCTCGTGGTGGCGCCGTACGTGGCTCGCTACGCCGAGTACACGCGCCCGCTGGTGGACCATCTCGTCGATTTGAAATTGGAACACTGGGACTGCGCCATCAGAGAACTGGCCGCGAAGGCGCTCAGCGAACTCACTAAACAGACACCAGATTACGTGGCGAAGGAGGTTTTACCGAAATTAGTGAAGAAGACGGAATCTATTGACCTCAACGTACGCCATGGAGCCATACTGGGTATCGGTGAGGCGATATACGCTCTCTCTCAGACGGAACTACCCGACGGCGCGAAGGCATCCGTGTTGATAACAGCTGATGTGTGGCGTGGTGTGCTGGGCGTGCTGGAAGCGCTCCGTGGCCGGCAACAGCTCCGCGGCCTGGGCGGGGAGCTCATGAGGCAGGCGGCGTGCAACGCGCTCGGCAGACTAGCCACTGCCGCCGCGCCCATACACTCAGCCGACACGCTCGATGAGTGGCTGAATCTGATCGAGGAGTGTCTGTCCCATGAAGTACAGACGATTAGAGAAAAGGCCATCGACGCTTTGCCGAAAATATTCGAAGAATATCTCAAGGACGATAAAGTCCAATATTCTGAAATCAACGCCAAGGAGAAGAGAATGCAACTGGTGCAAAAGTACTGCGAACATTTGAACAGCTCCGGGGTCAATGGCCTCTTTCTCAGGATGGGATATTCTCGAGCTCTTGGTTCGCTGCCGAAGTTCGTTTTGTTGGAGAGCATGCAATTAGTCATCGAGTCACTGATACAGTGTACTAAAGTGACAGAGGCAACCATGAAGTGGGCGGAGGCTCGCCGGGACGCGGTGCTAGGGTTGACGGACGTGTGTCAAACTGTAGGGCTCCAAGGAGAGATGGAGCGGTATGTGGAGGACGTTAGGACAGCCCTACTGGACTGTCTGGCGGAGTACACTGTGGACATGAGAGGAGATATAGGAGCCTGGGTCAGGGAAGCCAGTATGACCGGTCTTGTGTCGCTGTGTAGTCAGTGTTCGTCCCAGGCGCCTCACTTGAACACGCCCTCGGCGGTCGGAGACACGGCTCGCGGTCTGGCTCAGCAGGCCGTGGAGAAAATAGACAGGACGAGAGCACACGCCGGGAGGCTCTTCACCGCACTCATCTACAACGATCCACCGATAAGCAACATACCGCAACACGAAGCACTGAAGCGGATATTCCCCAACGAGAAGGACAGGAAGTGCGATGATGTGGAGGGCAGAGACAGCACCAGCGAGGGCGGGGTCATACTGTGGCTGTTCCCCGGACACACTATGCCGAGATTCGTTCAGCTGCTGCAGTACCCCGAGTACAGGTATCACGTGATCAGGGGATTGGTGGTCAGCGCCGGGGAACTCACTGAGAGCCTGCACACGACGCAGGCGTTGTTCTCGTACCTCAACTCGCTGCACTCGTCCCCGCAAACGCTGGCCACCATCTGTGACACCATAGTGAAGGTGTTCGCTGATAATATCCACGTGAAGCGCATCACCGGACCCATGTTCAACTTCCTCGACAGACTGCTCAGCTCCGGATCAATATCTCCTATATTAGATGATCCCGAGTCCACATTCGCTAAGGATATATTGAAATATTTGAAATTGGAACTGAGAGGTGGAAAGAACCTCTACAAGTTGCTTGATTCCATCAACGTATTGTGTCAGCTGCTGCAGGTCGGCGGTGTGGTTTGGTCCAATGCGTTGGGCCAACTCGTGGTGTATCTTTGCTACGGTGAGGGATACGTCCGTCGTTGTGCTGCTGCGAGGCTGTACGAGGCGTTGTCTCTATACGGTGACGTCAGCGAAGTGCCGCCGCCTGCCCTCGAACAGGTGATGACAATACTCGCAGAAACTGATTGGGAAAAAGACGTGACTGTATTAAGACCAATCAGAAACGAGATATGCGACCTCATGAACATCAAACGACCAGTCATGAAGCAGAGACAGACGTAA

Protein sequence:

>DPOGS204512-PA
MVALDGDINKDDDCENIGLGCALEHFSEVEDVLNMIENVKNIYNTPTFEVEYDKLYTILKQYYEQPHLLDPHLDKLLAKFMSIIKDKESPIELKNATFNYMYQIIRVRGYKVVVRHLPHEVSDLLTVLSSLEAQDPNDKETWRSRFVLLLWLSIVVIIPFHMSRLDGFAPNASGAGSSKKLTVMERILNICKTYALSKDSCAEASAYLASKFLIRSDVKELYMSQFFDWACELHSNIQEEETIHYGVLAAVAAVLKHGKRDDLLPFTPKLLEWVTTQNYQQHKAMLVRKYGVKIVQRIGLTFLRPRVASWRYTRGARSLAVTLGAAAAGDNEPMTVDPDDDDQDIPQEVEDVVELLLRSLRDEDTVVRWSAAKGVGRIGARLPAMAAADVCDSVLTLFADNERDTAWHGGCMALAELGRRGLISPRQLSSTVRCCSAALARDEPRASGGGGGGGGRAARDAACHASWAIARAYDATALTPHATVLANALIATACFDREINCRRAASAAYQENVGRHGLFPHGIDVLTAADFQSVGPRSHAYLVVAPYVARYAEYTRPLVDHLVDLKLEHWDCAIRELAAKALSELTKQTPDYVAKEVLPKLVKKTESIDLNVRHGAILGIGEAIYALSQTELPDGAKASVLITADVWRGVLGVLEALRGRQQLRGLGGELMRQAACNALGRLATAAAPIHSADTLDEWLNLIEECLSHEVQTIREKAIDALPKIFEEYLKDDKVQYSEINAKEKRMQLVQKYCEHLNSSGVNGLFLRMGYSRALGSLPKFVLLESMQLVIESLIQCTKVTEATMKWAEARRDAVLGLTDVCQTVGLQGEMERYVEDVRTALLDCLAEYTVDMRGDIGAWVREASMTGLVSLCSQCSSQAPHLNTPSAVGDTARGLAQQAVEKIDRTRAHAGRLFTALIYNDPPISNIPQHEALKRIFPNEKDRKCDDVEGRDSTSEGGVILWLFPGHTMPRFVQLLQYPEYRYHVIRGLVVSAGELTESLHTTQALFSYLNSLHSSPQTLATICDTIVKVFADNIHVKRITGPMFNFLDRLLSSGSISPILDDPESTFAKDILKYLKLELRGGKNLYKLLDSINVLCQLLQVGGVVWSNALGQLVVYLCYGEGYVRRCAAARLYEALSLYGDVSEVPPPALEQVMTILAETDWEKDVTVLRPIRNEICDLMNIKRPVMKQRQT-