Monarch geneset OGS2.0

DPOGS209283
TranscriptDPOGS209283-TA960 bp
ProteinDPOGS209283-PA319 aa
Genomic positionDPSCF300522 + 27771-33276
RNAseq coverage265x (Rank: top 40%)
Annotation
HeliconiusHMEL0176155e-9768.34% 
BombyxBGIBMGA001712-TA4e-10961.67% 
Drosophilaatt-ORFA-PA1e-9061.94% 
EBI UniRef50UniRef50_E2BFL65e-9159.25%Solute carrier family 25 member 42 n=1 Tax=Harpegnathos saltator RepID=E2BFL6_HARSA
NCBI RefSeqXP_311055.44e-9561.28%AGAP000097-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3838610973e-9355.70%PREDICTED: solute carrier family 25 member 42-like [Megachile rotundata]
NCBI nr blastxgi|1951589405e-9161.28%GL13935 [Drosophila persimilis]
Group
Gene OntologyGO:00068101.1e-34transport
GO:00550851.1e-34transmembrane transport
GO:00057431.1e-34mitochondrial inner membrane
GO:00054881.1e-34binding
KEGG pathwayptm:GSPATT000329930015e-32 
 K05863 (ANT)maps-> Huntington's disease
    Calcium signaling pathway
    Parkinson's disease
InterPro domain[31-308] IPR0233959.3e-77Mitochondrial carrier domain
[33-46] IPR0020671.1e-34Mitochondrial carrier protein
[126-213] IPR0181083e-23Mitochondrial substrate/solute carrier
Orthology groupMCL13320 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209283-TA
ATGGCGGTGGGAGAGGCCCGTGTGCCGCTTCTACATGAAGACAGACGCGGACACCATGAGCCCCGTCGCCTCACGGGCGCTTCTTTGGTTATAACTTCACTGGCGGCTGGTGCGGCGGCCGGAGCCCTGGCCAAAACGGCCATCGCTCCCTTGGACAGGACCAAGATCAACTTCCAAACCTCCGAGATACCGTATTCCTGGCGCGCTGCCGTCCGCTTTATAACCCACAGCGCCCGCACCGAGGGCGTAGCGGCTCTGTGGCGGGGAAACAGCGCGACTATGGCTCGCATTGTTCCCTACGCCGCTATACAGTTCACAGCGCACGAGCAATGGAAGACGTTGCTGAAGGTAGACTCGCCGGAGACAGCGCAAGGTTCTCCTCTCCGCCTGTTGTTGGCGGGTTCTCTGGCTGGCGTGACGTCACAGAGCGCGACCTACCCTCTAGACCTGGCTCGGGCTCGGATGGCCGTCAGTTCGTCCAGGGAGTACACCTCGCTGCGGCAGGTCTTCGTGAGAGTCATCAGGGAAGAGGGCCTCCGAACGCTTTACAGAGGTTACCCGGCGACTGTCCTCGGTGTGGTCCCGTACGCGGGCGTGTCGTTTTTCACGTTTGATTCCTTGAGGCACTGGTACCTGGACCGCCACGGCGTGTCTCCGAGCGGCGTGACCAACATGTTGTTCGGTGGCGTCGCGGGTGCTCTCGCCCAAACCGCTTCCTATCCTCTGGACATAGTGAGGAGACGAATGCAGACGGCTCACCGGCGCCCTGACGCCTCTTACCCCTACCCCACGATACTGGCCACGCTCGCCTCCGTCCACAGGTTGGAAGGTTGGCGCGGGTTTTTCAAAGGTCTGAGCATGAACTGGATCAAGGGACCCATAGCCGTGGGCATCTCGTTCGCCACTTACGACGCCATCAAATCCACGTTGAGAGACATCTCGCTCACGCTAGTTACTTAA

Protein sequence:

>DPOGS209283-PA
MAVGEARVPLLHEDRRGHHEPRRLTGASLVITSLAAGAAAGALAKTAIAPLDRTKINFQTSEIPYSWRAAVRFITHSARTEGVAALWRGNSATMARIVPYAAIQFTAHEQWKTLLKVDSPETAQGSPLRLLLAGSLAGVTSQSATYPLDLARARMAVSSSREYTSLRQVFVRVIREEGLRTLYRGYPATVLGVVPYAGVSFFTFDSLRHWYLDRHGVSPSGVTNMLFGGVAGALAQTASYPLDIVRRRMQTAHRRPDASYPYPTILATLASVHRLEGWRGFFKGLSMNWIKGPIAVGISFATYDAIKSTLRDISLTLVT-