Monarch geneset OGS2.0

DPOGS215387
TranscriptDPOGS215387-TA942 bp
ProteinDPOGS215387-PA313 aa
Genomic positionDPSCF300088 - 403270-406371
RNAseq coverage172x (Rank: top 50%)
Annotation
HeliconiusHMEL0097181e-15178.91% 
BombyxBGIBMGA014083-TA4e-14978.16% 
DrosophilaCG9393-PA7e-4734.29% 
EBI UniRef50UniRef50_UPI0002247D465e-7744.26%UPI0002247D46 related cluster n=1 Tax=unknown RepID=UPI0002247D46
NCBI RefSeqXP_624291.23e-7844.86%PREDICTED: similar to metaxin 1 [Apis mellifera]
NCBI nr blastpgi|3838517311e-7947.26%PREDICTED: metaxin-1-like [Megachile rotundata]
NCBI nr blastxgi|3454973871e-7944.59%PREDICTED: LOW QUALITY PROTEIN: metaxin-1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00066267.2e-12protein targeting to mitochondrion
GO:00057417.2e-12mitochondrial outer membrane
KEGG pathway 
InterPro domain[8-73] IPR0195647.2e-12Mitochondrial outer membrane translocase complex, Tom37/Metaxin
[152-235] IPR0109873.1e-08Glutathione S-transferase, C-terminal-like
Orthology groupMCL14232 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215387-TA
ATGGCTAGCATTGAATTAGATATATGGAAAGGAGAATGGGGACTTGCGTCTATAGATTTAGAATGCCTTAAAGTTCTGACTTATATGAAGTTTATCGGATTACCAGTTAAAGTTCGGGAATCAAGCAATCCATTTTTCACGCCAAAAGGAGCCTTGCCAGTCATGAAAGATGGTAGAACTGTTCTTACAAATTTTGAAGAAGTAGTCCAATACCTTAAGTCATTGCATTATAGCACAGATGTTCATTTAAATTCAAACCAGTCGGCTGAAGCTAGTGCATTTACTCAATATTTAAGGGACAAGCTGTATCCAGCATATCAGTTTACTTGGTGGGTTGATGAAAAGAATTATGGAGATATAACACGACCCGCATATGCCAAAGCTCTAAAATTACCATTTAATTTTTACTACCCATCAAAATATCAAAAGGCAGCTAAAGATATGGTTGATGCATTGTACGGTGAAAACACTGATTTAAAGGAAATTGAAAAAACAATATACAATGAAGCAGAGAAGTGTCTTAAGACACTCTCAGATAGACTCGGTGAGAGTGAATATTTCTTTGGCAACCGGCCTTCTTCTTTTGATGCTATAGTGTTTGCTTACCTCGCACCCCTCATAAAGACGCCCTTCCCCAATGCAACTTTGTCTAGTCATGTGAAGGGTATAGCAAATTTAAGTAGATTTGTGGCTAGAATCAGCCAGAAAAATTTTAGATCTTTCGCGGATGAGTACAAGAAGTCATCGACTCGTGTGTCCGGCGTTCAGACCTCCGGCGAGGCTCAGTTCCCAAACGCCACTCGTAACAAGTTGTTGGCGGGACTGTTCGCGACACTGGCTATGACGGGATACGCCCTCGCCACAGGAATGTTCCAGGACATGAAAGATTACGAGCACTCACAAGAATACAACGACATGTTCGAAAACGAAGAAGACAATTGA

Protein sequence:

>DPOGS215387-PA
MASIELDIWKGEWGLASIDLECLKVLTYMKFIGLPVKVRESSNPFFTPKGALPVMKDGRTVLTNFEEVVQYLKSLHYSTDVHLNSNQSAEASAFTQYLRDKLYPAYQFTWWVDEKNYGDITRPAYAKALKLPFNFYYPSKYQKAAKDMVDALYGENTDLKEIEKTIYNEAEKCLKTLSDRLGESEYFFGNRPSSFDAIVFAYLAPLIKTPFPNATLSSHVKGIANLSRFVARISQKNFRSFADEYKKSSTRVSGVQTSGEAQFPNATRNKLLAGLFATLAMTGYALATGMFQDMKDYEHSQEYNDMFENEEDN-