Monarch geneset OGS2.0

DPOGS200466
TranscriptDPOGS200466-TA1302 bp
ProteinDPOGS200466-PA433 aa
Genomic positionDPSCF300260 + 270554-276435
RNAseq coverage523x (Rank: top 24%)
Annotation
HeliconiusHMEL0127877e-17968.09% 
BombyxBGIBMGA011375-TA1e-16561.24% 
DrosophilaCG7639-PB2e-6433.33% 
EBI UniRef50UniRef50_E2A0E11e-9742.99%Sorting and assembly machinery component 50-like protein n=9 Tax=Formicidae RepID=E2A0E1_CAMFO
NCBI RefSeqXP_001603591.12e-10244.39%PREDICTED: similar to Sorting and assembly machinery component 50 homolog (S. cerevisiae) [Nasonia vitripennis]
NCBI nr blastpgi|3320229733e-10143.76%Sorting and assembly machinery component 50-like protein [Acromyrmex echinatior]
NCBI nr blastxgi|1892383414e-9844.28%PREDICTED: similar to sorting and assembly machinery component 50 homolog [Tribolium castaneum]
Group
Gene OntologyGO:00198679.9e-32outer membrane
KEGG pathway 
InterPro domain[228-432] IPR0001849.9e-32Bacterial surface antigen (D15)
Orthology groupMCL14026 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200466-TA
ATGGGCACTGTACATGCTAAGGCAGATAACAGTCAGGTGGAAGGGGGTTTGTTTGATTTGGACGAGGCGAACACGATGGATCCTATATCGAAACCATCAATTAAATTAAACGGAGTCAGGGCAAGAGTGGACAGAGTTCATGTGGATGGCCTGAGTCGGACTAAAGACGACATTATCAGAAGCACCGTTGACGACCTGTTCTATGCTACTGACTTCGAAGATGTCATACTACGAGCTCATAAAGTACGTAGGGCTTTGGACTCTATGGGCTGCTTCAAGGACATTGGAGTTTATATAGATGTGTCCAGTGGACCGGGAGCCACGCCTGAAGGGCTAGAGGTGACATTTCAAGTTAAGGAGCTGTCCCGCGTGCTGGGCGGCGTGAACACGACGGTTAGCGAAAACGAGGGGAATCTCGTAATAGGTGTGAAGCTGCCCAACGTGTTCGGTCGCGGGGAGCGCGCCGCTGCCGAGTACAGCGTGGGTCACAGGAGCTCCTCCAACTTCAACATGTCCGCTACGAAGCCCTACCCTCACGCACCTCTCACACCGGTGAGGGGAACTACAAAAAAGCTAGCAGGAGATTATATGAGACTCCATGCTCCACCTCTGTCTCTTCGTACTAAAACTCGTTGTTCTATCAATCTCATAGTAATGGTCAGTCCCCAGGTGAAGTCCGTGCTCCGTCACATAGTGAGCGTGGATCACCGTGACGAGAGCGTGTTCCCTACGCGCGGCACCTGGGCGCAGTTCACTAGCGAGCTGGCCGGGCTGGGGGGTGGAGTAGCCAACATCAGGACAGAGCTACAGGCGCAGGGGAATAAGGAGATATATCCTGGAGTTGTGTTCCAATTGAGTGGAGCCCTAGGGGTGCTCCACGACGTGTACGGGACCGACATCCCTGAGCACTTCTACCTCGGAGGACCTTCGACCATCCGGGGCTTCGGTCAACGGGGCGTGGGCCCTCACGTTGAGAACCACTCCTTAGGAGGAACCGTTTATTGGGCGTCTGGGGTCCATCTATACACCCCCCTCCCCTTCCAACCCGGCAAGGGGGGGCTTGGTGAATTGTTCAGATCACATTTCTTCATCAACGGCGGATGTTTGGCTCATCCCGAAGAATCTGGTCTGGCTATATTGGATGCCCTGACGGAGGCTCGGCTGGCGTGCGGAGCGGGGGTCGCGCTACGACTCGGAGGAGTAGCGAGGGTCGAACTCAACTACTGCGTGCCCTTGAAAGCTCGCCAGGGTGACGTCACCGCTCAAGGGATACAGTTTGGGATAGCAGCTCACTTCCTATAG

Protein sequence:

>DPOGS200466-PA
MGTVHAKADNSQVEGGLFDLDEANTMDPISKPSIKLNGVRARVDRVHVDGLSRTKDDIIRSTVDDLFYATDFEDVILRAHKVRRALDSMGCFKDIGVYIDVSSGPGATPEGLEVTFQVKELSRVLGGVNTTVSENEGNLVIGVKLPNVFGRGERAAAEYSVGHRSSSNFNMSATKPYPHAPLTPVRGTTKKLAGDYMRLHAPPLSLRTKTRCSINLIVMVSPQVKSVLRHIVSVDHRDESVFPTRGTWAQFTSELAGLGGGVANIRTELQAQGNKEIYPGVVFQLSGALGVLHDVYGTDIPEHFYLGGPSTIRGFGQRGVGPHVENHSLGGTVYWASGVHLYTPLPFQPGKGGLGELFRSHFFINGGCLAHPEESGLAILDALTEARLACGAGVALRLGGVARVELNYCVPLKARQGDVTAQGIQFGIAAHFL-