Monarch geneset OGS2.0

DPOGS212498
TranscriptDPOGS212498-TA2049 bp
ProteinDPOGS212498-PA682 aa
Genomic positionDPSCF300222 + 163662-169068
RNAseq coverage155x (Rank: top 53%)
Annotation
HeliconiusHMEL0095062e-12767.84% 
BombyxBGIBMGA010224-TA2e-9658.36% 
DrosophilaCG7183-PA9e-3333.33% 
EBI UniRef50UniRef50_E2BN327e-4342.39%Uncharacterized protein C3orf19-like protein n=4 Tax=Formicidae RepID=E2BN32_HARSA
NCBI RefSeqXP_001603880.12e-3940.06%PREDICTED: similar to GA20163-PA [Nasonia vitripennis]
NCBI nr blastpgi|3071804159e-4339.80%Uncharacterized protein C3orf19-like protein [Camponotus floridanus]
NCBI nr blastxgi|3479655476e-5330.10%AGAP001231-PB [Anopheles gambiae str. PEST]
Group
KEGG pathway 
Orthology groupMCL12523 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212498-TA
ATGAATGGACAAAATTCAAAAGAGATTTTCTTTGATAAATCAATTTTATTAAGTTTGAAGGCAGAATTACTGAAGAAAAAAGAGGAGGCTTTAGAAAAGAAACACTTGCCTCAGCATAATGTTCAGAACTTCAAGCCTTCTATTAGTGAAAAGAAAAACAAATCTGAGAGCAATAAAACCAGTTTAAAAGACAAGTTAAAAGTTATTGACACTGATGAGTTTGAAGCATGCAGGAAGTCAAAAGCTGCCTTAGAGAAGAAAGCTGAATTGTATGAGCATTTAGCAGATAATGTGGGAGATTCAAAGCTTGCTGGTCAGTTTTTGGTAGATTTTAAGGGTAAAAAACAAGAGACCATACAACATCCGAAACAAGATGAAACGGAGGAAAAAGAAGATGTTTTTCATGAAGAAGATGAAAGCGAATGGATCGAATTTACCGACTTCTTGGGGAGGACACGTAAGTGTCTTAAGTCAGATTTGGATTATTACCAGAGACGTGATCAGGAGTTAAAGAAAATAGTCACAAATGAACCCGCAGACGATAAAACTGATACAATGGAGCAGAGTTCGAAAGAAGCAGAGAAGCCATTGTTAGTGCAGAAGACGAATGACTATCTCCAGTCATTGAGAGAGAAATGGGAGCAAAAGGAAAGAGAGCTGTTAGCTAAAGAGAAGGATATCCATTATCAGGACTTACTATTTGATGAGGCAAGAATCCACGGCGTGGGCTACTACTCCTTCAGTACAGATGAGACGGAAAGAAAGAAACAAATGGAGGAATTGATAAAGACGAGGAAAGAAACATTAAAAGCACAGGAAGAGGCCGAGAAGCTTAGGAAAGAGAGAGATGATATGATAGCAGCTAGAGTTGCAGCTGCTAGAGCGAGACAGAGAATGAGGGCGGGGCTACCACCAGAAGATCCTGAAGTGCCTTCTGAGCAGAGTTCGAAAGAAGCAGAGAAGCCATTGTTAGTGCAGAAGACGAATGACTATCTCCAGTCATTGAGAGAGAAATGGGAGCAAAAGGAAAGAGAGCTGTTAGCTAAAGAGAAAGATATCCATTATCAGGACTTACTATTTGATGAGGCAAGAATCCACGGCGTGGGCTACTACTCCTTCAGTACAGATGAGACGGAAAGAAAGAAACAAATGGAGGAATTGATAAAGACGAGGAAGGAAACATTAAAAGCACAGGAAGAGGCCGAGAAGCTTAGGAAAGAGAGAGATGATATGATAGCAGCTAGAGTTGCAGCTGCTAGAGCGAGACAGAGAATGAGGGCGGGGCTACCACCAGAAGATCCTGAAGAAAAGAAAAAAGATTTTACCACATGCCTATTACAATTCCTCACTCAACAAAAGGACGAAGCTGACAAAAAAGCGAAGGAAGAAGAGGAGAAGGCTAAGAAAGAAAGAGAAGAAGAGAGACAGAAGCTTCGTGAAGCTTACATACGAGAATGGGATGTAGGGAAGGATGGACTTCAGGGAAATGTAAAGAAGTTCAGAGAAATGTCCCAAGAAGAGTACGTCGAACAGCAGAGAGCTAAGAGGATAAACGAATTCGCACCACCACAGTCCTCTACGAGAGAAAAATCAATGTATACCTTCAACAAGGACGGCAGAAAAATCGATAGTGATAATAAAACGAAATCCTGGTCCGAGGTCAGACCGATGAATACTCCGCCGCCGCCGAATATATCGGATATAACCGATGATACAAACAAAGGGTTATATTTTACAACCAAGAAACCCGAAACTATAGTTAAATATAAAAATTTCATCAAGGCAATCGAACCTACGGCTATTGTCAATGAATTAAGTGATGATGAAGAGGATGTACAAAGACAGTCTGAAGGAAACGTTAGTTGTAATAAAGCAGAAATATCGCCTCCACCGACATACGAATATTACGGCCCTGAGGCTAAATATAGAAAAGCCGATAAACCTTTCAAATCAGATATACGAGAAGCCATGGAACAAGGCGCGCGAAGTCTGGAGACTAAGGAGAGCAGTAGAAAAATAGGAAAGCAGTACGATTTCACTTTTGATTGA

Protein sequence:

>DPOGS212498-PA
MNGQNSKEIFFDKSILLSLKAELLKKKEEALEKKHLPQHNVQNFKPSISEKKNKSESNKTSLKDKLKVIDTDEFEACRKSKAALEKKAELYEHLADNVGDSKLAGQFLVDFKGKKQETIQHPKQDETEEKEDVFHEEDESEWIEFTDFLGRTRKCLKSDLDYYQRRDQELKKIVTNEPADDKTDTMEQSSKEAEKPLLVQKTNDYLQSLREKWEQKERELLAKEKDIHYQDLLFDEARIHGVGYYSFSTDETERKKQMEELIKTRKETLKAQEEAEKLRKERDDMIAARVAAARARQRMRAGLPPEDPEVPSEQSSKEAEKPLLVQKTNDYLQSLREKWEQKERELLAKEKDIHYQDLLFDEARIHGVGYYSFSTDETERKKQMEELIKTRKETLKAQEEAEKLRKERDDMIAARVAAARARQRMRAGLPPEDPEEKKKDFTTCLLQFLTQQKDEADKKAKEEEEKAKKEREEERQKLREAYIREWDVGKDGLQGNVKKFREMSQEEYVEQQRAKRINEFAPPQSSTREKSMYTFNKDGRKIDSDNKTKSWSEVRPMNTPPPPNISDITDDTNKGLYFTTKKPETIVKYKNFIKAIEPTAIVNELSDDEEDVQRQSEGNVSCNKAEISPPPTYEYYGPEAKYRKADKPFKSDIREAMEQGARSLETKESSRKIGKQYDFTFD-