Monarch geneset OGS2.0

DPOGS206894
TranscriptDPOGS206894-TA2637 bp
ProteinDPOGS206894-PA878 aa
Genomic positionDPSCF300001 - 1818188-1823119
RNAseq coverage3261x (Rank: top 4%)
Annotation
HeliconiusHMEL0068620.093.98% 
BombyxBGIBMGA012851-TA0.088.09% 
DrosophilaeIF3-S8-PA0.060.79% 
EBI UniRef50UniRef50_Q0ZB760.087.98%Eukaryotic translation initiation factor 3 subunit C n=29 Tax=Coelomata RepID=EIF3C_BOMMO
NCBI RefSeqNP_001037658.10.087.98%eukaryotic translation initiation factor 3 subunit C [Bombyx mori]
NCBI nr blastpgi|1129832280.087.98%eukaryotic translation initiation factor 3 subunit C [Bombyx mori]
NCBI nr blastxgi|1129832280.089.67%eukaryotic translation initiation factor 3 subunit C [Bombyx mori]
Group
Gene OntologyGO:00064133.2e-211translational initiation
GO:00037433.2e-211translation initiation factor activity
GO:00058523.2e-211eukaryotic translation initiation factor 3 complex
GO:00055154.7e-12protein binding
KEGG pathway 
InterPro domain[32-665] IPR0089053.2e-211Eukaryotic translation initiation factor 3 subunit 8, N-terminal
[738-826] IPR0007174.7e-12Proteasome component (PCI) domain
[743-809] IPR0119914.3e-09Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL12025 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206894-TA
ATGAGTCGGTTCTTCGCAACTGGAACTGACTCCGAGTCTGAAAGTTCATCTGAAGAGGAACAGGTAGTAAGAGCGCCGGCGCCCGTTTACACGTTCAGTGATGATGAAGAAGAAACTAAACGTGTTGTACGTTCTATGAAAGAGAAAAGATATGAAGAATTGGAAGGAATAATTCATTCGATTCGTAACCATCGTAAGATCAAGGACTTCGCGTCAGCTTTAGCTTCATTTGAAGAGCTCCAAAAGGCTTACACTCGAGCAGCACCTGTTGTCGCTAAGGAGGAAAACGGTGTCGCTCCACGTTTCTTTATTAGAGCGTTGACAGAACTAGATGATTGGGTCTCGGGAGCATGGAATGATCGAGACAATAGGAAAACTCTTTCCAAGGGCAACAGTAAAGCACTGACGTCCTTGAGACAGAAACTTCGGAAGTATATAAAGGAGTTTGATGCCGAAATTTCCAAATTCCGTGAAAATCCAGACTTACCAGACGATGACGACGAGCGCAAAGATACGTCTTCATCTGACGAATCCGAAGATGAAGAAAAGGTTAAAGAAAAACCAAGAGTTCGCCGTTCACCGGAGCCCCAGCGTGGGCCGCCGCCTCAAGATGATGACTCATCCGACACTTTCGACTGGGGTTCTAGTTCGTCTGATTCGAGCTCAAGTTCCGACGATGAGACAAGGGCTGCAACAATACGAGAGAAATTCTTAAAGAAGACCACGGAAAGGGATGACGATGAGGAGAGAGACAGGCGCCGTGCTAGGCGTGAGAGACGAGAAAGGGTTGGGAAAATTAGCAAGAAGGATCAAGCTGATGACGGTGGAGAATGGGAGACAGTAAGGAAGGGCGCCGCGACATCAGATAAACCAAAAATGTTCGCTAAGGACAGTGACATTGATGCAGCGTTAGTAGTGAAGAAACTTGGCGAAATCAGTGCAGCTCGTGGTCGGAAAAGAACAGACCGAAGGGCACAACTGGAACTACTTCATGAACTCCGAACTGTGGCACAACAGCACAACCTCGGTGATGCTCTCCAGCTCAAACTGCGCGCTGCAACTGTTGCGTCACTTTTCGATTACAACCCCAAGGTCTCTGATGCCATGAAACCCGAATACTGGTCGCGCCTTGTAGAGAATGTGGATCAAATGGTCACTCTCCTTCTAGCCCATGAGGATATGATGCTAAGTGAAACAATCACCGAGGAGAATGAACAGCTAGTTACTCCACCATTTAAAGTCAGGGGCTGTCTGCTGACCGCATTGGAGAGACTTGACGACGAGTTCATAAAGCTGCTTAAAGAATGCGACCCTCATTCCAACGACTACGTAGAGAGATTGAAAGATGAAGTAAGGGTGTCCGCTCTTATCGATCGAGTCTGCCAGGTCGTAGAACGAGATGGAAGCCCTCAGGAAATCTGTCGCGCATATTTACGTAAAATTGATCATCTTTACTATAAATTTGATCCTCGTGCTATCAAGAAAGATTTGTCACCAGGTGAAGAAACAACTATTAAAAAGATGGAACGTCTATGCAAGTATATTTATGCTAACGATGATTCCGACCGTCTGAGAACTCGAGCGATACTATCGCATATATACCATCACGCCTTGCACGACAACTGGTTCCAGGCGAGGGATCTTCTTCTCATGTCCCATCTTCAAGAGAACGTTCAACACTCCGATCCCAGCACTCAGATATTGTACAACCGTACAATGGCTAATCTTGGGTTATGTGCTTTCCGTCGGGGCAATGTCAAGGAAGCCCATGGTTGTTTGGCCGAACTCATGATGACGGGCAAACCGAAAGAGTTACTCGCTCAGGGCTTGCTGCCACAGCGTCAACATGAGCGTTCTAAGGAACAGGAGAAAATAGAGAAGCAGCGCCAAATGCCGTTCCATATGCACATTAACTTGGAACTTTTGGAGTGCGTCTACTTGGTATCGGCCATGCTGATTGAAATTCCATACATGGCTGCTCATGAATTTGATGCCCGTCGTCGTATGATAAGTAAGACATTCTATCAGAACTTGCGTGCGAGTGAAAGGCAGGCTTTGGTTGGTCCGCCGGAGTCTATGCGTGAACACGCTGTGGCAGCCGCGAGGGCAATGAGGAGGGGAGACTGGCGCGCCTGCCTTAACTTTATCGTAAACGAAAAAATGAACGCAAAGGTCTGGGATTTGATGGTCGGCGCTGAAAACGTGCGTGCTATGCTCGGTAGACTGATAAGGGAGGAGTCACTGAGAACATATTTGTTTACGTATGCTCATGTGTACGCATCGCTGTCATTGCATTCGCTGGCTGACATGTTCGAATTGCCGAGACAACGCGTTCACTCTCTCGTATCTAAGATGATCATTAACGAGGAACTGTTGGCTTCCCTGGACGATCCGAGCGAATGCGCCATACTCCACCGCTCTGAACCAACCAGAATGCAAGCGCTGGCGCTGCAACTCGCAGATAAGGTGGGTAACCTAGTAGACTCTAACGAGCGGATCTTCGAGAAGCAGGGCTCGTTCTTCCAGCGCGGTGGCGCGCCGCGGGGGGAAGGCAGACAGCGGGACAGACCACGCGAGGGCTGGGGCCGCAGGCAGCGCAACAGGCGTCGAGACGACGAGCGCGCTCACGAAGACTAA

Protein sequence:

>DPOGS206894-PA
MSRFFATGTDSESESSSEEEQVVRAPAPVYTFSDDEEETKRVVRSMKEKRYEELEGIIHSIRNHRKIKDFASALASFEELQKAYTRAAPVVAKEENGVAPRFFIRALTELDDWVSGAWNDRDNRKTLSKGNSKALTSLRQKLRKYIKEFDAEISKFRENPDLPDDDDERKDTSSSDESEDEEKVKEKPRVRRSPEPQRGPPPQDDDSSDTFDWGSSSSDSSSSSDDETRAATIREKFLKKTTERDDDEERDRRRARRERRERVGKISKKDQADDGGEWETVRKGAATSDKPKMFAKDSDIDAALVVKKLGEISAARGRKRTDRRAQLELLHELRTVAQQHNLGDALQLKLRAATVASLFDYNPKVSDAMKPEYWSRLVENVDQMVTLLLAHEDMMLSETITEENEQLVTPPFKVRGCLLTALERLDDEFIKLLKECDPHSNDYVERLKDEVRVSALIDRVCQVVERDGSPQEICRAYLRKIDHLYYKFDPRAIKKDLSPGEETTIKKMERLCKYIYANDDSDRLRTRAILSHIYHHALHDNWFQARDLLLMSHLQENVQHSDPSTQILYNRTMANLGLCAFRRGNVKEAHGCLAELMMTGKPKELLAQGLLPQRQHERSKEQEKIEKQRQMPFHMHINLELLECVYLVSAMLIEIPYMAAHEFDARRRMISKTFYQNLRASERQALVGPPESMREHAVAAARAMRRGDWRACLNFIVNEKMNAKVWDLMVGAENVRAMLGRLIREESLRTYLFTYAHVYASLSLHSLADMFELPRQRVHSLVSKMIINEELLASLDDPSECAILHRSEPTRMQALALQLADKVGNLVDSNERIFEKQGSFFQRGGAPRGEGRQRDRPREGWGRRQRNRRRDDERAHED-