Monarch geneset OGS2.0

DPOGS204697
TranscriptDPOGS204697-TA771 bp
ProteinDPOGS204697-PA256 aa
Genomic positionDPSCF300170 + 484044-501880
RNAseq coverage424x (Rank: top 29%)
Annotation
HeliconiusHMEL0082455e-9788.38% 
BombyxBGIBMGA007476-TA1e-10489.90% 
Drosophilaatt-ORFA-PA4e-3632.76% 
EBI UniRef50UniRef50_E3XCJ82e-7976.12%Putative uncharacterized protein n=3 Tax=Pancrustacea RepID=E3XCJ8_ANODA
NCBI RefSeqXP_309341.42e-9263.86%AGAP011308-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582872683e-9163.86%AGAP011308-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582872689e-9563.86%AGAP011308-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00068101.3e-28transport
GO:00550851.3e-28transmembrane transport
GO:00057431.3e-28mitochondrial inner membrane
GO:00054881.3e-28binding
KEGG pathwayptm:GSPATT000329930012e-24 
 K05863 (ANT)maps-> Huntington's disease
    Calcium signaling pathway
    Parkinson's disease
InterPro domain[9-243] IPR0233952.7e-57Mitochondrial carrier domain
[17-30] IPR0020671.3e-28Mitochondrial carrier protein
[14-100] IPR0181082e-24Mitochondrial substrate/solute carrier
[9-29] IPR0021671.3e-16Graves disease carrier protein
Orthology groupMCL15745 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204697-TA
ATGGCACTTACTATGGATCAGAAGAATAACTTGGACTTTATTTTAAAGAACTTACTCGCTGGTGGTGTTGCGGGCATGTGCGCCAAGACTACAGTAGCTCCGTTAGATCGCATAAAGATTCTCCTCCAAGCTCAGTCTTCACATTACAAACACCATGGAGTGTTTGGTGGACTTATGGCTATCGTGAAACAGGAGTCACTCATAGCTCTATACAAAGGAAACGGTGCTCAGATGGTCAGAATATTCCCCTATGCGGCGACACAGTTCACAAGCTTTGAAATATATAAACGGTATCTCAGCGGGTTAAGCGTGCCTCTAGTTCAGCATGGTGATAAGTTCGTGGCTGGTGCTGGCGCCGGTGTGACCGCGGTGACCTTGACTTACCCCCTGGATACAATAAGAGCTCGTCTGGCGTTCCAAGTGACTGGCGAGCACAGGTACAACGGCATCGTAGACGCCGCTACAACCATATTTAAGACGGAGGGCGGTATACTAGCTCTATACCGTGGTTTCGTTCCCACTATGGTCGGCATGGTTCCGTACGCTGGTTTCAGTTTCTACTGCTTCGAGTCCCTCAAATTCCTCTGCATGAAAACGGGCATGATTAAGACCCTGACTCTCATTTACCGTGAGAACGGTATAGTGAAGGGCTTGTATCGAGGGATGACAGTTAACTACATCCGCGCCATACCGATGGTGGCGACGTCTTTCTCCACGTACGAGCTGATGAAGCAGTTACTCCACCTCGATACTGGCATGAAAATATCATAA

Protein sequence:

>DPOGS204697-PA
MALTMDQKNNLDFILKNLLAGGVAGMCAKTTVAPLDRIKILLQAQSSHYKHHGVFGGLMAIVKQESLIALYKGNGAQMVRIFPYAATQFTSFEIYKRYLSGLSVPLVQHGDKFVAGAGAGVTAVTLTYPLDTIRARLAFQVTGEHRYNGIVDAATTIFKTEGGILALYRGFVPTMVGMVPYAGFSFYCFESLKFLCMKTGMIKTLTLIYRENGIVKGLYRGMTVNYIRAIPMVATSFSTYELMKQLLHLDTGMKIS-