Monarch geneset OGS2.0

DPOGS209514
TranscriptDPOGS209514-TA963 bp
ProteinDPOGS209514-PA320 aa
Genomic positionDPSCF300127 + 232207-233169
RNAseq coverage142x (Rank: top 55%)
Annotation
HeliconiusHMEL0160284e-13570.59% 
BombyxBGIBMGA007429-TA9e-10957.37% 
Drosophila% 
EBI UniRef50UniRef50_Q180903e-0821.46%Mitochondrial import receptor subunit TOM40 homolog n=8 Tax=Chromadorea RepID=TOM40_CAEEL
NCBI RefSeqXP_003116745.14e-0921.86%CRE-TOMM-40 protein [Caenorhabditis remanei]
NCBI nr blastpgi|3323736901e-1121.95%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323736903e-1122.27%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00068208.7e-13anion transport
GO:00550858.7e-13transmembrane transport
GO:00083088.7e-13voltage-gated anion channel activity
GO:00057418.7e-13mitochondrial outer membrane
GO:00440708.7e-13regulation of anion transport
KEGG pathwaycbr:CBG032251e-08 
 K11518 (TOM40)maps-> Amyotrophic lateral sclerosis (ALS)
InterPro domain[43-282] IPR0019258.7e-13Porin, eukaryotic type
Orthology groupMCL25349 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209514-TA
ATGGAAATAAACGAATCCGTTATTAAGGAGGAGGTTGGAAATTTTCTAAATAGCCTGCAGAAGTTATTACCGAAGAAAAAGGACTTCATAGTTATACGGAATGACAAACCCACTTTGACAAGACTAAGTGATGTTCACGCGGAGGCAAAGAAAGTATTTCCACAATGTTTTGTTGGTGCCAAAATGGTTATTATGAGGGATGTACTTGAAAAAGTGAAATTGGTACAAAATTATAACTTAGGGAAATCGAAAGACTCTTACAAATGTTTCTCACAACTTATACATAAGGAAATGGATGAAAAGAATGTCGCAGACGGTTTGTTGGTTGACTCGGCTGGGTCAGCAACAGCAACCTATACTGAAAAAATGAACGATTACGAGATGAAACTGACATCCAAGATAAAAGATTTAGTGTCATCAGAAACAGAGATATCGTTTGAAACCGATAGTAAAAAATCAATAAGGTCCATGTCATTTGCGTTAAAGGATGCAGATCCAAATACACTTAAGGTTGTCACCCAGTGGATGTACAAAGTCTCGCCAGAGTTTTGTGTTGGCAGCGAAATGGGATTTAAACTGTTATCGTACCCTTTGTCTCCAGAACTATCGATCAGTGCGAGATACGACAAGCCTGCGTTTACTCTGTCATCAACTATCAGTCGGGCCGGTTACCAAGTGTGTTTATTCAAACCTTTCACGTCAGATTTGCGAATAGCCACGATTATTAACGAGAACAATCGAGGTGGTTCCGCCACAATAGGTCTCGCGTTGCACAAAAGCTACGAAAATTCTGAATTGAAAATTTTCGTTGACTCGCAACGTTGCGGAGGCTTCACTTTTGAAAAGGATGTTTTGTTCAAAGAGCAGCAAAACGATGTAAGAGTTATTCGATTAATGGTCAGCACTATCATCGATCGCCAGAAGCGAGTCCGATGCGGTTTCGGTTTCAATCTAGACTTCTAA

Protein sequence:

>DPOGS209514-PA
MEINESVIKEEVGNFLNSLQKLLPKKKDFIVIRNDKPTLTRLSDVHAEAKKVFPQCFVGAKMVIMRDVLEKVKLVQNYNLGKSKDSYKCFSQLIHKEMDEKNVADGLLVDSAGSATATYTEKMNDYEMKLTSKIKDLVSSETEISFETDSKKSIRSMSFALKDADPNTLKVVTQWMYKVSPEFCVGSEMGFKLLSYPLSPELSISARYDKPAFTLSSTISRAGYQVCLFKPFTSDLRIATIINENNRGGSATIGLALHKSYENSELKIFVDSQRCGGFTFEKDVLFKEQQNDVRVIRLMVSTIIDRQKRVRCGFGFNLDF-