Monarch geneset OGS2.0

DPOGS215635
TranscriptDPOGS215635-TA1086 bp
ProteinDPOGS215635-PA361 aa
Genomic positionDPSCF300041 - 1823513-1825069
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0045195e-17278.39% 
BombyxBGIBMGA003535-TA4e-16276.88% 
DrosophilaTom40-PB3e-2023.86% 
EBI UniRef50UniRef50_E2A0E81e-2027.62%Mitochondrial import receptor subunit TOM40-like protein n=4 Tax=Formicidae RepID=E2A0E8_CAMFO
NCBI RefSeqXP_966771.12e-2929.66%PREDICTED: similar to mitochondrial import receptor subunit tom40 isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|3323736902e-2929.82%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323736902e-2829.82%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00068207.7e-24anion transport
GO:00550857.7e-24transmembrane transport
GO:00083087.7e-24voltage-gated anion channel activity
GO:00057417.7e-24mitochondrial outer membrane
GO:00440707.7e-24regulation of anion transport
KEGG pathwaytca:6575427e-29 
 K11518 (TOM40)maps-> Amyotrophic lateral sclerosis (ALS)
InterPro domain[87-353] IPR0019257.7e-24Porin, eukaryotic type
Orthology groupMCL24920 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215635-TA
ATGAAAACAATCCACGCTAAATCTAATCCATGTTCTGGAACAAGTAGCCCTAGAAAAGAGAACGTTTCACCTTGTCCGCCGATCGCTGACAAAGCCGGAGGAGGCGACGCACCACCTCCAAAGAAATGTGCATGCAGAGTATCAGATAGCTCGAAATCTGCAAGTGATCGGGGAATTACTGGGGCTGGGGTGACTGGCTCAAGAACTGGAAGAATCTGTGACAGGGATGACGAGCCTCCACTACCCCTGGGAGCTACCAATCCTGGACTCCTTCGCCACATACATAACGCAGCACGACAACGTATACCACAATGTTTCGAGGGCGCCTGTGTGTCCATGAAACATAGTTCTACTAGCAACTGGGTTGTTGGCCATTCTATGTCGTTCAGTTCCGTTACTCCAGGAGGTTACAAGATTTTGTTGTCTTATGCTGATAAGAAAAGGTCCATCGGTTTGCCATATTTTGTAATGGAAGCAGCTCCGGGTGGTCAGATGAGTTGTGAGATACGTGTCGGTCCGACCTCAGGTACAAGAGCGACGGTCGTAGCCCAAGTAGCGGATGGAGAAATTTATAGCTTCGAGGGCATCTACGACGCGTACTTTAACAATTTCACCTCCTCCATTATTGCCGTCAATCGCGAATTCATTGCACTGCATTTCTTACAAGCTGTTACTGAACAAATTTCTCTTGGTGCGGAGGTTGTAGCCCGGAGCCATGCGGCTGAATTAAGTTCTGCTTCAGGGGCGGCTCGCTGGGCCGCTGAACACCATTCCGTTAGCGCGACACTCGGAAACCGTGGTCTTGACCTATGCTATGCGAGGAATATTAAACCATTCCTGACTGTTGCTGCTATGCTTGAGGTGGGATTTGCCATACGGCGAGCGGTGGCCACGCTGGCCTACGAGTGGCACACTGACCAGTGGACGGTGAGAGCCTCCGTGGACTCCGACGGACTCGTGGGCGCCACTCTACAAAGGGCGCTCGGCGGCAAGAGATCACATCTAGCTTGCGCTATATCAGCTCTCCTCAATCACCCGAACGACAAATTCAGACTGGGATTTGGTGTAACCGCGGCGATCATATAA

Protein sequence:

>DPOGS215635-PA
MKTIHAKSNPCSGTSSPRKENVSPCPPIADKAGGGDAPPPKKCACRVSDSSKSASDRGITGAGVTGSRTGRICDRDDEPPLPLGATNPGLLRHIHNAARQRIPQCFEGACVSMKHSSTSNWVVGHSMSFSSVTPGGYKILLSYADKKRSIGLPYFVMEAAPGGQMSCEIRVGPTSGTRATVVAQVADGEIYSFEGIYDAYFNNFTSSIIAVNREFIALHFLQAVTEQISLGAEVVARSHAAELSSASGAARWAAEHHSVSATLGNRGLDLCYARNIKPFLTVAAMLEVGFAIRRAVATLAYEWHTDQWTVRASVDSDGLVGATLQRALGGKRSHLACAISALLNHPNDKFRLGFGVTAAII-