Monarch geneset OGS2.0

DPOGS204038
TranscriptDPOGS204038-TA2127 bp
ProteinDPOGS204038-PA708 aa
Genomic positionDPSCF300138 + 99373-104538
RNAseq coverage121x (Rank: top 57%)
Annotation
HeliconiusHMEL0049586e-6450.18% 
BombyxBGIBMGA004788-TA1e-12058.42% 
Drosophilatacc-PK5e-2837.25% 
EBI UniRef50UniRef50_D6X3W97e-5951.50%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X3W9_TRICA
NCBI RefSeqXP_967717.11e-5951.50%PREDICTED: similar to LOC565922 protein [Tribolium castaneum]
NCBI nr blastpgi|910915743e-5851.50%PREDICTED: similar to LOC565922 protein [Tribolium castaneum]
NCBI nr blastxgi|3556956571e-5840.62%intraflagellar transport 46-like protein [Mustela putorius furo]
Group
KEGG pathway 
InterPro domain[473-697] IPR0220881.9e-68Intraflagellar transport complex B protein 46 C-terminal
[57-264] IPR0077073.6e-30Transforming acidic coiled-coil
Orthology groupMCL13286 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204038-TA
ATGGCTGTTATTGACAGACTCTTATCTCTAAGCGGAAACTCTTCGCGATTAGAAGAAGCTAGTTCTGTACCTCTGCCCAGACACTACAGTGAGACAGATTTTGCCCTGACCCAACTCAGGGAACTATTGGCAGAGAAAGAAATAAACGTACATAATTTAAGGTTAGAGTCCAAGGAATTAAAAGAAAGATTAGCAAACATGGAAACTCAAGTGCACAGTTTAGAAAATGAGAGCAAAGAAAGATTGAGGAAAGTGAACGAGCTCAACGAAAGACTAGCGGAAAAGAATACGATTAACAAGAGTCTGGCTGTAGTAGTGGAAGAATATGAACGGACCATCGCCAGTTTGATAGCAGGAATGGAACAGGATAGAAAGAGAAATGCAGAAGAGAGAAAGAGAATCATCAGCGAGAGAGACGAACAGACAGCTCACCTCGCGAGCATGGAAGTGTCCTTCAGGGATCTACACAGTAAGTACGAAAAGAGTAAGCAGATAATATTAAATATGAAGGCCAACGAGGACAAGTACAAGGCATCTCTGAAGACATTCGAAGACAATCTGTTGAAAATGCAGAACAACTATGAGTTACTAAAACAGCACGCGACATCGAAACTCAACCACGCCAATCAGGAGTTGGAGAAGTATAACAGGAGCCACGAGGCGGAGGTCTTGAAGCTGAACGCTATGATTAAGAGGAAGGAATTACATATAACGTCACTGGAAGAATCGCTGTCTCAGAAGACCAAGGCCAACGAGGAATTGACGGCCATTTGCGATGAGCTCATTAATAAAATAATTGTGTTATCAAACTGTGACACACTCGCCACACATCACAGAGACGCGCCAACCCCCGTACCTCACAGTGCCAGCTTAATAAGAACGCCCCATTTAAGATGCATTTTTCTCACATTTGCCGCCATTCTGCCGGGCCTGGTGATACACTCTAGCGTTGCTATGACAACCAAGGACAAACAACACAAGCTCTGTCAAACTGTAGCATTAAATACGGAGTTGGAATCCATTCCTGGGCTTGAGGTCCAAGGAATGTATGACGAGACCGTGGAGGTTAACGCCAAAGAGATTGAAAGCCCTCCGACTTCCGACGAGGAGAGTATAAAGGCTATGTCAAAGTTCAAACCTCCCGTACGTCACACCGCCAGACATGACTTCGATTCAGACAGCGAGAGCGATTCCAGTCCTGACAAGTTTGATAATCTTACAGAAGAGGATGAGTCTCCAGAAAGAAAGTCCGATATTGAGGATGCGAAGAAAAAAGTAAAAAGCAGCAGCGATTTATCAAGACAGTCCAGCAATCAACATAACTCCTCGCACTCTGAATCAGATAGTGCTGACGTGGGTTTGGAACAAGTGGAAGCCAGCAGTGGGAAGAAACGTGGCGTGGTGATCCCGGCAGAGGGCGCGTATGATCCGAAGGCCTACGCGGACCTGAAGGTGCCGCCCGAGCTTGACAATGTGTACACCCCTCAGAAGATCGACATAGACTTCAAGCTCCATCCGTTCATCCCGGAGTACGTCCCCGCTGTGGGAGACGCGGACGCCTTCCTCAAGGTGACCACACCAGCCTCAGGGCTGAGAGGAAAAGCGTTAGCTGATAACGCCTTGGACTTCATTGACAACTTAGGTCTGACGGTGTTGGATGAACCCTCCGCGGACCAAAGCGATGCTGCGTTACTTCATCTCCAACTGAGAGCCATCTCCAAGACCACCAGCGCCAAGTCAACTGTGATGACACGAAAACTTGAGAACGCTGAGAACAACCCTCAAGCGCTGGATCGCTGGATACGTGATGTGAGTGCGCTTCACGCGGGCCGCTCCCGCGCTACGGTCGCGTACACACGGAAAATGCCAGATATAGACGATCTAATGGCAGAATGGCCGGACGCGATAGAGGATACCTTGAATGAGGTCGGCGTGCCCCCCGCCAGCGTCGACTGCTCGCTGTCACAATACGTCGACATAGTCTGTGCCGTGTTCGACATCCCCGTCCACGGCGACACTGTCAATGACAGGATACAAGCGCTTCATCTGCTTTTCAGTTTGTATTCGGCGGTAAAAAACTCGCAGTTGTTTGCGGAAAGAGAGAAGGAGAAGGGTATGGCCGGTTAG

Protein sequence:

>DPOGS204038-PA
MAVIDRLLSLSGNSSRLEEASSVPLPRHYSETDFALTQLRELLAEKEINVHNLRLESKELKERLANMETQVHSLENESKERLRKVNELNERLAEKNTINKSLAVVVEEYERTIASLIAGMEQDRKRNAEERKRIISERDEQTAHLASMEVSFRDLHSKYEKSKQIILNMKANEDKYKASLKTFEDNLLKMQNNYELLKQHATSKLNHANQELEKYNRSHEAEVLKLNAMIKRKELHITSLEESLSQKTKANEELTAICDELINKIIVLSNCDTLATHHRDAPTPVPHSASLIRTPHLRCIFLTFAAILPGLVIHSSVAMTTKDKQHKLCQTVALNTELESIPGLEVQGMYDETVEVNAKEIESPPTSDEESIKAMSKFKPPVRHTARHDFDSDSESDSSPDKFDNLTEEDESPERKSDIEDAKKKVKSSSDLSRQSSNQHNSSHSESDSADVGLEQVEASSGKKRGVVIPAEGAYDPKAYADLKVPPELDNVYTPQKIDIDFKLHPFIPEYVPAVGDADAFLKVTTPASGLRGKALADNALDFIDNLGLTVLDEPSADQSDAALLHLQLRAISKTTSAKSTVMTRKLENAENNPQALDRWIRDVSALHAGRSRATVAYTRKMPDIDDLMAEWPDAIEDTLNEVGVPPASVDCSLSQYVDIVCAVFDIPVHGDTVNDRIQALHLLFSLYSAVKNSQLFAEREKEKGMAG-