Monarch geneset OGS2.0

DPOGS203770
TranscriptDPOGS203770-TA942 bp
ProteinDPOGS203770-PA313 aa
Genomic positionDPSCF300010 + 618986-622331
RNAseq coverage737x (Rank: top 18%)
Annotation
HeliconiusHMEL0042272e-16788.57% 
BombyxBGIBMGA013348-TA7e-15682.54% 
DrosophilaTom40-PB8e-9255.40% 
EBI UniRef50UniRef50_Q9U4L61e-8955.40%Mitochondrial import receptor subunit TOM40 homolog 1 n=24 Tax=Coelomata RepID=TO401_DROME
NCBI RefSeqXP_317621.23e-10159.93%AGAP007871-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|583902896e-10059.93%AGAP007871-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|583902893e-9559.93%AGAP007871-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00068201.7e-57anion transport
GO:00550851.7e-57transmembrane transport
GO:00083081.7e-57voltage-gated anion channel activity
GO:00057411.7e-57mitochondrial outer membrane
GO:00440701.7e-57regulation of anion transport
KEGG pathwayaga:AgaP_AGAP0078711e-100 
 K11518 (TOM40)maps-> Amyotrophic lateral sclerosis (ALS)
InterPro domain[31-307] IPR0019251.7e-57Porin, eukaryotic type
Orthology groupMCL14186 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203770-TA
ATGGGAGCAATAGCCAGCAGGTACCGTGTATCAAACGCGGGTCTCACTAACGCGGACATACAGAATATAGAAAATGCTAACAAACGTGAAAACCCGGGAACTTTGGATGAACTCCATAAAAAAACTAAAGATGTATTGCCAGTTAACTTTGAAGGCGAAATTAAAATCCATAAAACATTGAACTATCAACAGATGTCCCACACCTTAACAATGAGCTCAATGCAAAATGGTTACAAGCTGGGTGCAACATATATTGGTACAAAGCAAATATCGCCAACCGAGGCATTTCCCGTTGTGCTGGGTGATGTGGATCCCGCCGGAAATGTTAACTTCAATCTTATACATCAACTAACCCAAGACGTCAGAGTTAAAGTTGCTGCACAGGTTCAAGAATGCAAGTTGTCAGCGACCCAGGGCACAGTTGAATACAAGGGTTCGGATTATACTCTAGCCCTGGCCGTCGGTAAACCTGACTGTAGTGATAAATCATCTGTGTTTGTTGGACATTACTTGCAGTCAGTCAGTAAGCATCTAACCCTGGGTGCGGAGCTAGTGTATCAGAGCAGCGCACGCATAGCCGGAGGAGAGGTGGCCATCGCCTCGGCCGCAGCAAGATATACTATGGATGATTCTGAGGTATCGGCAACGCTGGGAGCTGCTAGCTTCCATGTCTGCTACTTCAAACAAGCTAGTGAACAATTACAGGTGGTTGCTGAAATGGAGACTTCATTCAGGGGTATGGAATCAACAGGCACTATTGGATACCAAGTGACCATACCTAAAGCGGAGCTTGTCTTTAGAGGCATGGTAGACTCAAATTGGAACATCGGGGCCGTTCTGGAGAAGAAACTCCAGCCGTTGCCGTTTACGTTTGCTCTGTCAGGTATGAGCAACCAGGCCAAACAGCAGTTCAAGTTTGGTTGTGGTCTCATCATCGGATAG

Protein sequence:

>DPOGS203770-PA
MGAIASRYRVSNAGLTNADIQNIENANKRENPGTLDELHKKTKDVLPVNFEGEIKIHKTLNYQQMSHTLTMSSMQNGYKLGATYIGTKQISPTEAFPVVLGDVDPAGNVNFNLIHQLTQDVRVKVAAQVQECKLSATQGTVEYKGSDYTLALAVGKPDCSDKSSVFVGHYLQSVSKHLTLGAELVYQSSARIAGGEVAIASAAARYTMDDSEVSATLGAASFHVCYFKQASEQLQVVAEMETSFRGMESTGTIGYQVTIPKAELVFRGMVDSNWNIGAVLEKKLQPLPFTFALSGMSNQAKQQFKFGCGLIIG-