Monarch geneset OGS2.0

DPOGS202142
TranscriptDPOGS202142-TA1212 bp
ProteinDPOGS202142-PA403 aa
Genomic positionDPSCF300193 + 60598-62541
RNAseq coverage558x (Rank: top 23%)
Annotation
HeliconiusHMEL0146240.083.51% 
BombyxBGIBMGA001464-TA4e-17877.81% 
DrosophilaCG6404-PA5e-12859.83% 
EBI UniRef50UniRef50_Q9Y1717e-12659.83%BcDNA.GH02220 n=15 Tax=Endopterygota RepID=Q9Y171_DROME
NCBI RefSeqXP_001652200.17e-13464.27%cytochrome oxidase biogenesis protein (oxa1 mitochondrial) [Aedes aegypti]
NCBI nr blastpgi|1571141831e-13264.27%cytochrome oxidase biogenesis protein (oxa1 mitochondrial) [Aedes aegypti]
NCBI nr blastxgi|1571141833e-12864.27%cytochrome oxidase biogenesis protein (oxa1 mitochondrial) [Aedes aegypti]
Group
Gene OntologyGO:00160211.5e-117integral to membrane
GO:00512051.5e-117protein insertion into membrane
KEGG pathwayaag:AaeL_AAEL0067342e-133 
 K03217 (yidC, spoIIIJ, OXA1)maps-> Bacterial secretion system
    Protein export
InterPro domain[29-386] IPR0017081.5e-117Membrane insertion protein, OxaA/YidC
Orthology groupMCL14378 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202142-TA
ATGTTTAAATTACTTAGTCGACACGGCCGTCGAATCGGTATTAGAAATTTATTCTGCGAAAAGCAAATTGAGGTTAAAAAGGCAAGGGTTTATTATTTCTATTCTTCTGCGGGTTCAGTTCGATTTGCTTCCACCGGTGCTGATGTAGGGAAAACAATATCATTAGAATCTATTCCCGAACCCCCTCCCGTTCCTGACAAATCAGTCGTTGGTGATATAACAGAAGCTGTTCAAAGTTTGGCAGCTAATGGAGAACCTTCATTCGCAAGCCTTGGACTTGGCGGTTGGAGCCCAGTTGGTATCGTACAAAACTGTCTCGAATATCTGCATGTGTCCCTGGATGTGCCATGGTGGGGAGCCATATTAATAGGAACTGTTGTTGTAAGAACTCTCATGTTCCCTCTTGTCATTATATCTCAGAGAAACACTGCCAAAATGAATAATAATTTACCTGAGATACAATTACTTCAAATGAAGATGTCACAGGCCAGACAGACTGGAAATCAATTGGAGTCGGCTCGGTACGCTCAAGAGATGATGTTATTTATGAAAGAAAAGGGTTTGAATCCTTTGAGAAACATGATTGTGCCGCTTGCACAAGCGCCATTATTTATTTCATTTTTCATAGGCTTAAGAGGAATGGCAAACTGTCCTGTAGAGAGTATGATGTCTGGTGGTATGTGGTGGTTTACTGATTTGACTGTTCCTGATCAGTTTTTCATATTGCCACTCATAACAAGTGCTACTATGTGGGCTACAATTGAACTAGGAGTTGATGGTGGCAGATTAGAAGCATCAAACATGCAAATGATGAGATATTTCTTGAGAGCAATTCCCGTAATCATGATACCCTTCACAATAAATTTTCCCGGAGCCATTCTAGTGTATTGGTGTTCGACCAACTTCATATCTCTCTGCCAGGTCGCTGTGCTTAAGCTTCCTGGGGTGAGAGAGTATTTCAAGATACCAAAATTGATTAAGCACAATGCTGAAAGCTTACCAATGAAGAAGAAGGGTTTTGTAGAAGGGGCCAAGGAGTCATGGACTAACATGAAAATATCTAGAGAACTTGCGGATAGACAGAGAGTAGATGAGATGATCTTTACCAAGGCTGGGAAAGGATTTCTTGAAAGTCACTATGAGGAAAGTGCTTCAATGTCCTGCGAATGGAGAAGGTTCAGTTGTTACGACCTTAATTTAGGTGGAGGATAG

Protein sequence:

>DPOGS202142-PA
MFKLLSRHGRRIGIRNLFCEKQIEVKKARVYYFYSSAGSVRFASTGADVGKTISLESIPEPPPVPDKSVVGDITEAVQSLAANGEPSFASLGLGGWSPVGIVQNCLEYLHVSLDVPWWGAILIGTVVVRTLMFPLVIISQRNTAKMNNNLPEIQLLQMKMSQARQTGNQLESARYAQEMMLFMKEKGLNPLRNMIVPLAQAPLFISFFIGLRGMANCPVESMMSGGMWWFTDLTVPDQFFILPLITSATMWATIELGVDGGRLEASNMQMMRYFLRAIPVIMIPFTINFPGAILVYWCSTNFISLCQVAVLKLPGVREYFKIPKLIKHNAESLPMKKKGFVEGAKESWTNMKISRELADRQRVDEMIFTKAGKGFLESHYEESASMSCEWRRFSCYDLNLGGG-