Monarch geneset OGS2.0

DPOGS201400
TranscriptDPOGS201400-TA1002 bp
ProteinDPOGS201400-PA333 aa
Genomic positionDPSCF300083 + 570292-576471
RNAseq coverage312x (Rank: top 36%)
Annotation
HeliconiusHMEL0128523e-14986.60% 
BombyxBGIBMGA002002-TA1e-16090.64% 
DrosophilaCG32103-PB2e-13066.07% 
EBI UniRef50UniRef50_Q9VTX38e-13366.76%CG32103, isoform C n=24 Tax=Coelomata RepID=Q9VTX3_DROME
NCBI RefSeqXP_001972530.12e-13567.35%GG13834 [Drosophila erecta]
NCBI nr blastpgi|1948698325e-13467.35%GG13834 [Drosophila erecta]
NCBI nr blastxgi|2700123231e-12871.47%hypothetical protein TcasGA2_TC006460 [Tribolium castaneum]
Group
Gene OntologyGO:00068109.8e-33transport
GO:00550859.8e-33transmembrane transport
GO:00057439.8e-33mitochondrial inner membrane
GO:00054889.8e-33binding
KEGG pathwaytps:THAPSDRAFT_391432e-33 
 K05863 (ANT)maps-> Huntington's disease
    Calcium signaling pathway
    Parkinson's disease
InterPro domain[52-326] IPR0233959.3e-86Mitochondrial carrier domain
[56-69] IPR0020679.8e-33Mitochondrial carrier protein
[142-231] IPR0181081.8e-27Mitochondrial substrate/solute carrier
[48-68] IPR0021671.3e-07Graves disease carrier protein
Orthology groupMCL14266 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201400-TA
ATGTCTCTGGAGTTGGAAGAAGTTACGGGACAGACGGGGGATGTGTTCGAGGATTTCCTCTTCGTTTTTCGTCGTTGTTTGGCAAAGTATCTCGACATCGGGGAGGATATGAACGTACCTGATGACTTCACTCCAACTGAACTCCAGACAGGGAAGTGGTGGCGACATTTGTTAGCTGGCGGTATCGCCGGGGCTGTCAGTAGGACTTGTACAGCTCCACTAGACAGACTTAAAGTATTTTTACAGGTTAATCCAACAAGAGAAAATATGGCCAAATGCTTAGCAAAAATGATCAACGAAGGAGGCATTGGTGGCCTCTGGAGAGGTAATGGAATAAATGTCATAAAAATAGCTCCTGAGTCAGCTTTGAAATTCGCTGCCTACGAGCAAGTAAAACGTCTCATAAAAGGAGAAAAAAATCCATTAGAAATATACGAAAGGTTCCTCGCTGGAGCCTCGGCTGGAGCCATAAGTCAAACTGTGATTTACCCACTAGAGGTGCTAAAGACAAGATTAGCTTTAAGAAAAACTGGCCAATATAGCGGTATCGTGGATGCGGCGAAGAAGATATACGCTAGGGAGGGATTGAAGTGCTTCTACAAGGGATACATTCCAAATATCCTAGGGATAGTGCCTTACGCTGGTATAGACCTCGCTGTTTACGAGACGCTTAAGAAGAAATACATTAATAAATACCAGACTAACAATGAACAACCGGGTATGTTGCTGTTACTGGCTTGTGGGAGCACGTCTTGTACACTGGGGCAAGTGTGTTCATACCCGTTAGCCCTCGTTCGGACAAGGTTGCAAGCACAAGAGAAAGCTGCTAAAGGCGCAGAAGGTACAATGCGAGGCGCCTTCCGTGAGATCGTTCAGCGCGAGGGTCTACGTGGGCTGTACCGCGGCATCACTCCAAATTTCATCAAAGTTATCCCAGCTGTCTCTATCTCATATGTTGTGTACGAGTACGCGAGCCGTTCGCTTGGTGTCAACATGACGTGA

Protein sequence:

>DPOGS201400-PA
MSLELEEVTGQTGDVFEDFLFVFRRCLAKYLDIGEDMNVPDDFTPTELQTGKWWRHLLAGGIAGAVSRTCTAPLDRLKVFLQVNPTRENMAKCLAKMINEGGIGGLWRGNGINVIKIAPESALKFAAYEQVKRLIKGEKNPLEIYERFLAGASAGAISQTVIYPLEVLKTRLALRKTGQYSGIVDAAKKIYAREGLKCFYKGYIPNILGIVPYAGIDLAVYETLKKKYINKYQTNNEQPGMLLLLACGSTSCTLGQVCSYPLALVRTRLQAQEKAAKGAEGTMRGAFREIVQREGLRGLYRGITPNFIKVIPAVSISYVVYEYASRSLGVNMT-