Monarch geneset OGS2.0

DPOGS211611
TranscriptDPOGS211611-TA1350 bp
ProteinDPOGS211611-PA449 aa
Genomic positionDPSCF300232 + 36345-39109
RNAseq coverage752x (Rank: top 17%)
Annotation
HeliconiusHMEL0143110.082.99% 
BombyxBGIBMGA008228-TA0.082.97% 
DrosophilaCG11779-PA3e-14660.70% 
EBI UniRef50UniRef50_Q9VDZ75e-14460.70%CG11779, isoform A n=21 Tax=Coelomata RepID=Q9VDZ7_DROME
NCBI RefSeqXP_001599273.18e-16367.35%PREDICTED: similar to mitochondrial import inner membrane translocase subunit tim44, partial [Nasonia vitripennis]
NCBI nr blastpgi|3454852801e-16261.23%PREDICTED: mitochondrial import inner membrane translocase subunit TIM44-like isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454852802e-15764.92%PREDICTED: mitochondrial import inner membrane translocase subunit TIM44-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00068862.1e-182intracellular protein transport
GO:00057442.1e-182mitochondrial inner membrane presequence translocase complex
GO:00154502.1e-182P-P-bond-hydrolysis-driven protein transmembrane transporter activity
KEGG pathway 
InterPro domain[22-449] IPR0173032.1e-182Mitochondrial inner membrane translocase complex, subunit Tim44
[293-442] IPR0073791.5e-41Membrane transporter, Tim44-related/Ribosomal protein L45
Orthology groupMCL12140 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211611-TA
ATGGTAAAATACAACCATCTGATTTCATTTCCAAAAGGTTATCATTCATGTCGACAGTTATTATGTAGTTCTGCACGCTGGAAGAGTGCTCTACCAATACATTTGAAAACACCCGTGTTAACAATTAAACCTGATCAGGGTGTAGTTCAGGAATGTCGTCAGTACTCAGCACGGAAAGGTTTCTTTTCCTCAATCCTGGAAAACATAAAAGAAGATATAGCTAAAAATAAAGAAATGAAAGAAAATATCAAAAAATTCAGGGAAGAGGCGCAGAAACTTGAAAACTCGGATGCATTACAAGCAGCTAGGAAAAAGTTTCTTGCTGTTGAATCAGAGGCATCGAAAGGTTCCGAGGTGTTGAAGGAAACCATTGAGGGTATTAAGGGGAAGGTGGAGCATGTTATAGAGGAAGCCAGTAAAACGGAGATAGCTAAAAGGGCGGGAAAAATAACAGAAGACATCTCAAAAACTGCAAAAGATGCTGCGGAGTCTTTAGCTGATAAGAGTCAAAAACTTGGTCAGACATCGGCCTTTAAAACAATATCTCAGGCTACGGAGGTAGTTAAAAACGAAATGGCTCCTAAAGGACTAGAGGGAAGAGTATACACATCACCGGCAACCTTACGGAAAAGACTCGAGGTTGCAGCCACCGAACGAACGTTTGAGGCGGACCCGAACGCTACAGGCCTGGAGTTACACAAAGATTCACGATTCTATCAACAGTGGCAGGACTTTAGAGACAACAACCAGTACGTCAACAAGGTACTGGATTGGAAAATAAAATATGAGGAATCAGAAAATCCAGTGTTTAAAGCGTCGAGGTTTGTGACGGAGAAGGTGAGCAGCCTGTTCGGGAACCTGTTCGAGAAGACGGAACTGTCACACACCCTGACCGAGATCTGCAAGATAGACCCCAACTTCACGGCGCAGAAGTTCTTAGAAGACTGCGCCAACGACATCATACCGAACATTCTGGAAGCCATGGTCCGGGGAGACATGGACATATTAAAGGACTGGTGCTACGAGGGCGTCTACAACATTCTCTCAGCGCCCATCAAGCAGTGCAGGCAGATGGGTTACAGGCTGGACTCCAAGATCCTGGACATCGAGAATATAGAGCTGGTCATGGGCAAGATGATGGACCAGGGTCCCGTGCTCGTCATCACGTTCCAGTCGCAGCAGATGATGTGCGTGAGGGACGCCAAGAACAACGTGGTGGAAGGAGACCCCAACAAGGTGATGAGGGTCAACTACGTGTGGGTTCTGTGCCGCGACCCCCAGGAGATGAACCCCAAGGCCGCCTGGAGACTGCTGGAGCTGTCCGCCAACAGCGTCGAGCAGCTGATATAG

Protein sequence:

>DPOGS211611-PA
MVKYNHLISFPKGYHSCRQLLCSSARWKSALPIHLKTPVLTIKPDQGVVQECRQYSARKGFFSSILENIKEDIAKNKEMKENIKKFREEAQKLENSDALQAARKKFLAVESEASKGSEVLKETIEGIKGKVEHVIEEASKTEIAKRAGKITEDISKTAKDAAESLADKSQKLGQTSAFKTISQATEVVKNEMAPKGLEGRVYTSPATLRKRLEVAATERTFEADPNATGLELHKDSRFYQQWQDFRDNNQYVNKVLDWKIKYEESENPVFKASRFVTEKVSSLFGNLFEKTELSHTLTEICKIDPNFTAQKFLEDCANDIIPNILEAMVRGDMDILKDWCYEGVYNILSAPIKQCRQMGYRLDSKILDIENIELVMGKMMDQGPVLVITFQSQQMMCVRDAKNNVVEGDPNKVMRVNYVWVLCRDPQEMNPKAAWRLLELSANSVEQLI-