Monarch geneset OGS2.0

DPOGS208167
TranscriptDPOGS208167-TA1779 bp
ProteinDPOGS208167-PA592 aa
Genomic positionDPSCF300486 + 10642-21557
RNAseq coverage862x (Rank: top 15%)
Annotation
HeliconiusHMEL0158380.081.94% 
BombyxBGIBMGA004539-TA0.070.21% 
DrosophilaCG1021-PH2e-10750.68% 
EBI UniRef50UniRef50_UPI00021A6B5D1e-12753.70%UPI00021A6B5D related cluster n=4 Tax=unknown RepID=UPI00021A6B5D
NCBI RefSeqXP_001603731.13e-12149.24%PREDICTED: similar to CG1021-PA [Nasonia vitripennis]
NCBI nr blastpgi|3072045274e-12752.16%Transmembrane and coiled-coil domains protein 1 [Harpegnathos saltator]
NCBI nr blastxgi|3838568288e-12246.60%PREDICTED: transmembrane and coiled-coil domains protein 1-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[146-561] IPR0193942.3e-116Predicted transmembrane/coiled-coil 2 protein
Orthology groupMCL11634 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208167-TA
ATGGATACATTGCACGTGGGACACGGTCATAGCAAGGGGTCCAGTAGGAGCCAGTCGCCTTCAGCCGCGGACCTCAGGGACCCTGCTGTCGTCACCGATGACGTCATCAGGTCCAGAGTTCAATCGCCATCGGCTGCACATCATGATGGTCGACATGGTGAATCCCGTCCACATCACGTTAAATCCCATTCGAGGGATTTGGGCAGCAATCCGCTATGCAGCAGCACAACAGCAAACACATCCAGAGACTTGGAAGAGTCGCCACAAAAGAAGCCCAGCATACATCCAGACTACACGTTTGCCGTCTGCAGTCCTAACGTGTCGTATGCTCTGTCATCAGCGGATGAGTACGGCTCACCAGGTTACAGAAGTCAAGATGGAGATTATTTGAGCCAAGAGGATATGGATGGTAATTCAACAGCGACGTCGTCCAAGGCGCAGGCTGCCATCGCTCATCTGAACGCTAAGATAGAACGCACCAAGGATCTCATACGACTGGAACAGACCACCAGAGATGGTAGGTGGGATGGCGGTGGTGAGAACGTGAACGAGTACCTGAAGCTAGCAGCGAACGCTGACAAGCAGCAGCTGGCGAGGATCAAAGCTGTGTTCGAGAAGAAGAATCAGAAGAGTGCGCTCTGTATCGTACAGCTGCAGAAGAAGCTGGAGGGGTACAACAAGAGAATTAAGAGCTGGGAGCAACACGGCACTAGCGGCCAGTCGCACAGACAACCGAGGGAAGTGTTACGAGATATGGGACAAGGACTCAAGAACGTCGGCGGTAACATTCGTGACGGCATCACGGGTCTCGGCGGCAGTGTGATGTCGGCACCCAGAGAGTTCGCTCACCTCTTCTCGCGGTCCAACCGATTCGGCTCAGCTGACAACATCGCACATCTCGCCGTATATAGTCCCCGTATTGAGTGTTCCTACAACGCAGCGAACGCGTCGTCCGAAGGCTCCCGTGCGTCGGAGGGCGAAGGCGCCCGGGGCTCTACGCTGCCTCGCGCGGGCGCCGCCCCCTCGCCCGCCCAGAAGTACGTCTCGCCGGAGGCTTCGCCCTCCTCGTCCGTCACCAGCGAGGGAGCCCCTTACCCGACACAAACGAATTCAAACAACATCGAAGCTTCGTTTAGCCTCAAACCCATACTGAACGAGCTGAAGGAGAGAAGAGAAGATTACGAGAGGATCGCTGAGAAGATAGACGCCTTGAAGAACCTCTCCCAGGAAGTGTCATTCCTGTCCGCGGCGCTGCAGGAGGAGCGCTTCAAGGCTGAGAGGCTCGAGGAACAGATCAATGATCTCACTGAACTGCATCAAAACGAGGTTGAAAATTTGAAGCAGGCACTAACAGACGTCGAGGAAAAAGTACAATATCAGAGCGAGGAACGAATACGAGATATACATGAAGCGTTGGACCTGTGTCAGACCAAGGTGAGGAGAACACGCTCAGGTGGTGGGGCCGGCGGGGGTGGGGGGGAGTCTGGTGGTGCCGCGGCGCTACCACCGCGACTACTGCTCGTCAAGTTGCTGAACGTTGCCATCACATTGCTGCAGTTGGCATTATTGCTGGTAGCAACAGTCGCTGGTGTGGCCATGCCGTTCTTGAGAACCAGAGTCCGTGTCCTGACCACCAGTCTTGCTGTGATGTTGGGTGTGATGGTATTAAAACAGTGGCCGGAAGTCACTCAACTGTCAGAACATCTCCTGAAAAGGCTCAAGGAATACTTGTTGGACAAACATCACGACAGGTATGAATGCAAGTCGTTGTGTGCTTGA

Protein sequence:

>DPOGS208167-PA
MDTLHVGHGHSKGSSRSQSPSAADLRDPAVVTDDVIRSRVQSPSAAHHDGRHGESRPHHVKSHSRDLGSNPLCSSTTANTSRDLEESPQKKPSIHPDYTFAVCSPNVSYALSSADEYGSPGYRSQDGDYLSQEDMDGNSTATSSKAQAAIAHLNAKIERTKDLIRLEQTTRDGRWDGGGENVNEYLKLAANADKQQLARIKAVFEKKNQKSALCIVQLQKKLEGYNKRIKSWEQHGTSGQSHRQPREVLRDMGQGLKNVGGNIRDGITGLGGSVMSAPREFAHLFSRSNRFGSADNIAHLAVYSPRIECSYNAANASSEGSRASEGEGARGSTLPRAGAAPSPAQKYVSPEASPSSSVTSEGAPYPTQTNSNNIEASFSLKPILNELKERREDYERIAEKIDALKNLSQEVSFLSAALQEERFKAERLEEQINDLTELHQNEVENLKQALTDVEEKVQYQSEERIRDIHEALDLCQTKVRRTRSGGGAGGGGGESGGAAALPPRLLLVKLLNVAITLLQLALLLVATVAGVAMPFLRTRVRVLTTSLAVMLGVMVLKQWPEVTQLSEHLLKRLKEYLLDKHHDRYECKSLCA-