Monarch geneset OGS2.0

DPOGS206004
TranscriptDPOGS206004-TA2853 bp
ProteinDPOGS206004-PA950 aa
Genomic positionDPSCF300253 - 152313-158204
RNAseq coverage115x (Rank: top 58%)
Annotation
HeliconiusHMEL0146120.053.78% 
BombyxBGIBMGA012663-TA6e-9952.14% 
DrosophilaCG3520-PA2e-5829.23% 
EBI UniRef50UniRef50_E2AZD01e-7325.05%Uncharacterized protein KIAA1797 n=7 Tax=Formicidae RepID=E2AZD0_CAMFO
NCBI RefSeqXP_001122559.15e-6926.03%PREDICTED: similar to CG3520-PA [Apis mellifera]
NCBI nr blastpgi|3838635952e-8927.09%PREDICTED: uncharacterized protein KIAA1797-like [Megachile rotundata]
NCBI nr blastxgi|3838635958e-8827.20%PREDICTED: uncharacterized protein KIAA1797-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[300-455] IPR0225429.8e-18Domain of unknown function DUF3730
Orthology groupMCL15217 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206004-TA
ATGATACACGGTATGGACCCTCGACCTTGTTATTCTCTCGTGGAGCGTGTGCTGAGCGAGGGCGGGCGGTCGGCGGTCGCGGGGCTTGTGATCATGATGCTGGGCGGAAATCTTGTTCACACATCCGCCCTGTATCTACATGAGTTGTTCCATTTATGCTTGAACATAATCACCAAATACGAGTTCTCGACTCTGTGTCTGAACTCATTCGTAGCTCTGTCACTGCAGTGGCTCCACTTACCGTCGTATTTGACCAACAACGCTCTCAAAGTGTCATCGAAAATATTGGAAATCCATCAAAACATGAATCGTCCGGACTCGGGGCTGTTCATGGCGAATATGAAAAACAACGCTGTCTTCCAAGAGATGGTCCACGTGGACCGGAAGCTGTACATACATTACAAACTGCTGGACACCTGGGAACGACTGAGAGACGAGCCTGACAGGCTGAGCAAGTGGTTCGACGGTCTGATGCGGTGCGATGACCGCCTCAAGGTCGAGCTGATGCCCTTCATCGCCGGCCTGGCTCTGGACGGGATGGGAGACGAGCGCCTGGTGCTGGCGGCGCTCCAGGGCCTCATACAACTCGTGGGCTTCAAGAAGGAAGTGTCCGTCACACTACTGCCCATACTGTTGTACAAAATAGCCAACGACCCCCGACCCAAGGTCAAGCTGGAGTGCCTGAGAGGAGAACGTCCCAGCGCTGGTGTCGATCTTCAACAAGCTGAAGAACAGGAAAGGAGTCCCGACCTCGCAGCTCATCATGATGTACACGTCGGACCGGCTCTGGGTCCACGGGGAGACATATATAGACATCCGAGGGGAAAACCGGGCTTTGTAGACCTTTCTCTTATTTTTGATTCTGTCGAATGTTACCCTTGTGAATGTCATGAAATTAGTAACTCAGTGCGCTGCTTCCCTTACTTACAAGAGTTGTTGTCGGACTCGTCGCTCCACCCGCACGACCTCAAGTGGGAGGTGGACATCGCCAAGGCGCTGGCCGTGCGGCGGATCTGTGAGATCCGGCCGTCCAGTCACGGTCTGGAGTTAGTTCCCGTGGTGTCGTTGCTCCTGAACCGCTGGGAGCGTTCCAGCGCGGGGCCGGTGTCGCTCGCGTTGGAGGCGCTCCGGCACCTGTGGCAAGGGGCGGCGGTGGCGCCCCCCGGCACGTGGCGCGCGCTCCAGCCGCGACTGGCCAAGGATAACAGGATACAAGTTCAGATCAGCCTGTGTAATCTGCTGGCGGAGGCGCCCGGCCTGCGCGTGTCGTCCGCGGAGTACGGCGAGCTGCTGCAGCAGACGGCCGCTCGTCTTTGGCTCTTCATATCCGACTCGGACCAGCCGGCCGTGATCGGGGCGGCGTGTCGCGCGCTCGCCGGCTATAAGATAGAAGATTACACGCTCAAGGACATCCCCGAGGTTTATCGGCGCACGGTGAAGCTGCCTCCGTCGTATTGCAAGACGCCTTCAGACGCCGCGAGGAAGCCCGAAGACGTCCTGGACTATGTGCCATGTGAAGTCTGGCCGGAGGTGTTTAAATACACGAACCAATCGGCTCTAGACTGCGTACAACACCTGGTGTCCAAGTTGATAGGGCGGGAGATTCGAGGGTACCGCAGCGGGGTCTACCACGAGCGGGAGGGGGGCAAGGAGGGGGCGGGCCTCAGTGTCAGCAGCGTATTGAGAGGGGTCGTGGAGGGACTCAGGAAACAGATGGTGAGCCCGACGTACGATTACTCGGACGCCGTCCTCCTGGCGATGTTGGAGACCTTGTCCTCGGAGTTCCCTAAGCCCCTGCCGCCCTTCGACCTCACATTCCTTCACGAGGGTCTGCACCGCGGGGCTCCGATGCGGGCTCGCGTCGTCAAATTGGCCGCCCGTCAAGCCAGTACCGCAGTGTCAGCCAAAAGACTGATTGAAAACTTCCTATCTGCAATCGACCCTGGGAATTGTGAGGAATCAGACATTTTATTGTTCTTCGAATATCTTCCTATCTTGTGTCGCACGATGCCCCCGAACCATCTCCGGCCTCCGCTCGAGAGATGTCTGAGTGACTCTTTCTCGAGGGTCAGGGTCAAAGGTCAGGAGGAGACGTTCATAAAGCAGTTGAACTACATCAAGGAGTGCCTCGACTGCGACAAGATCCACGATGCCAACAGGACCCTGCTGTCACAGCTGGTCGAGAGCTACTTCACTGTTATAGATGAGGACCACGTGGCGTGGTCCGCGTACCTGGCGGCGTGCTCGTCGCTGGTGGTGAGCTCGGTGGAGCGCATGTCGTCTCCGAGCTCGTGGTGGGAGGTGTCGGGCGCGCTGCTGAGGAAGGCCAGCGTGCTCCGCGCGAGGTTGGCCGCCAACAGACTCGCCTGGATCAACGAGATCGTGGACACCGCCGCGGGCCACGTCACTGAGCAGGAGTTCACGTTGCGATGTTTCCTACCCGCGCTACAGGCTACGGACGTCGACGCGACTAACACCCGCGAGTGGTTCCTGCAATTGATGGCTCGCACTCAGGTCGCCTTCAAAGAGACGGAGGAGGAGTCGGCCAGGTTATACCTGTGCGACGTGTTCTTCCTGAGCGTGGTAGTGTTCAGTGGTCTATGGACCCTGGAGGCGGACGGCGAGGCGCTGGTCGCCGACAGGGACGCCAGGCTGGGGCTCGCTCCCGCCGCCCTCGGCCTGCTCGTGGACAGGGACGGCTGGACGGACTACACCGCACAGTTGTTGGAGTGGTTGTGCCACACGCGCTCCGTGACCCGCCACGCCGGCGTGTCCCGCTGCTGCAGGCGCTCCGTGCTCGCCCTGAGGCACACGAGAGCCTTCCACGAGCACGCGGTCTTGATGAAGCTGTGA

Protein sequence:

>DPOGS206004-PA
MIHGMDPRPCYSLVERVLSEGGRSAVAGLVIMMLGGNLVHTSALYLHELFHLCLNIITKYEFSTLCLNSFVALSLQWLHLPSYLTNNALKVSSKILEIHQNMNRPDSGLFMANMKNNAVFQEMVHVDRKLYIHYKLLDTWERLRDEPDRLSKWFDGLMRCDDRLKVELMPFIAGLALDGMGDERLVLAALQGLIQLVGFKKEVSVTLLPILLYKIANDPRPKVKLECLRGERPSAGVDLQQAEEQERSPDLAAHHDVHVGPALGPRGDIYRHPRGKPGFVDLSLIFDSVECYPCECHEISNSVRCFPYLQELLSDSSLHPHDLKWEVDIAKALAVRRICEIRPSSHGLELVPVVSLLLNRWERSSAGPVSLALEALRHLWQGAAVAPPGTWRALQPRLAKDNRIQVQISLCNLLAEAPGLRVSSAEYGELLQQTAARLWLFISDSDQPAVIGAACRALAGYKIEDYTLKDIPEVYRRTVKLPPSYCKTPSDAARKPEDVLDYVPCEVWPEVFKYTNQSALDCVQHLVSKLIGREIRGYRSGVYHEREGGKEGAGLSVSSVLRGVVEGLRKQMVSPTYDYSDAVLLAMLETLSSEFPKPLPPFDLTFLHEGLHRGAPMRARVVKLAARQASTAVSAKRLIENFLSAIDPGNCEESDILLFFEYLPILCRTMPPNHLRPPLERCLSDSFSRVRVKGQEETFIKQLNYIKECLDCDKIHDANRTLLSQLVESYFTVIDEDHVAWSAYLAACSSLVVSSVERMSSPSSWWEVSGALLRKASVLRARLAANRLAWINEIVDTAAGHVTEQEFTLRCFLPALQATDVDATNTREWFLQLMARTQVAFKETEEESARLYLCDVFFLSVVVFSGLWTLEADGEALVADRDARLGLAPAALGLLVDRDGWTDYTAQLLEWLCHTRSVTRHAGVSRCCRRSVLALRHTRAFHEHAVLMKL-