Monarch geneset OGS2.0

DPOGS205985
TranscriptDPOGS205985-TA1125 bp
ProteinDPOGS205985-PA374 aa
Genomic positionDPSCF300164 + 169661-170785
RNAseq coverage50x (Rank: top 70%)
Annotation
HeliconiusHMEL0053338e-16777.84% 
BombyxBGIBMGA009413-TA4e-16774.13% 
DrosophilaCG8853-PA6e-5234.94% 
EBI UniRef50UniRef50_E2A0F41e-7442.05%Intraflagellar transport protein 57-like protein n=1 Tax=Camponotus floridanus RepID=E2A0F4_CAMFO
NCBI RefSeqXP_001604536.11e-7441.67%PREDICTED: similar to intraflagellar transport 57 homolog (Chlamydomonas) (predicted) [Nasonia vitripennis]
NCBI nr blastpgi|3838659272e-7543.27%PREDICTED: intraflagellar transport protein 57 homolog [Megachile rotundata]
NCBI nr blastxgi|3071882993e-7942.11%Intraflagellar transport protein 57-like protein [Camponotus floridanus]
Group
KEGG pathwaynvi:1001209443e-74 
 K04638 (ESRRBL1, HIPPI)maps-> Huntington's disease
InterPro domain[4-345] IPR0195309.7e-94Intra-flagellar transport protein 57
Orthology groupMCL13107 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205985-TA
ATGGATCTTAATATAATGGACAAACTGCGTATTCTGAAGATTGACACAGAGTTGCGTCCGCAAATAAAGATGAAAGCCATGAGTAGATATTATTTTGTGGCGCCTACGAATCCCGTAGAACAGTTCTTCGTCTTCGCATCTACAGCTGCGTGGCTTATAAGAAAATCTGGCAAAGACTTTGATCAACCGAACGAGGAAGATGATCCTAATGCAATCATCGCTGCCGTTCTTGATGTTATTCGGGAGAAAGATATAGCTGTGGACTTCTCAGCTCATAAACTCAAGCAAGGTTGCGGAGACCAAGTTTGTTATATCCTTAACGTTTTGGCTGACGAAGCCTTGAAGGTTGAAAATTTCGAATGGCTGAAACCTGTTGTTGATATTAATGAGTCGGAAGAAATTACAGACGACGTTGACCAGGTTGAGGATGAAACTGAAATCATTTTGGATAAAATAGAAGAGGAAATGGCGATTTATTCTGAGGAATCTGAGGGGGAGATTGAGAAGGAGGAGGAAAAAGATATCAACCTGACTACTAGAGTTCACGATTGGGAAGCTTGGAAATTAGAATTGGAGAGGGTTGCACCGGCTTTAAGATTAAAGGTATCCATAGATGGGCGAGATTGGAGAGCGAGGCATGCACAAATGAAGACCTACAGAGATGAACTATCCGAAAGATTCAAAACGACTGGGTCACAGTTAAACAAAGTCTACAGCAACATCACTTCAGTCATGGACAAAATAGGAGCAAGAGAGAATGTACTGAACGAAAGACTCGAACCGTTGGTCAGGGAATACGGATCTCTCTTGGACGAATTGAACAAAGTCACCAATGAGTATAAAGAGGCGAGTGTCGGCGTCACCGAGAGACAGGAAGTACTGAATGAGCTGACATCCAAGGTTGAGAACATGAAGCAAAAGACCGAATCACGCGGTTCGTCCATGAACGATAACTCGCCGTTGGTGACAGCAAAAAAAGCTGTCGACACTTTGAAGAAAGACATCCAAGAGCTCGACTTCCAGATAATGATTCTGCTCTGGCTGTTGATATCCAAGGAAAACCCGAAAAGTGGCAACTACTTGACATACACTGAGACGCGTTTCGTTGGCGCTGAAGTTTATTAG

Protein sequence:

>DPOGS205985-PA
MDLNIMDKLRILKIDTELRPQIKMKAMSRYYFVAPTNPVEQFFVFASTAAWLIRKSGKDFDQPNEEDDPNAIIAAVLDVIREKDIAVDFSAHKLKQGCGDQVCYILNVLADEALKVENFEWLKPVVDINESEEITDDVDQVEDETEIILDKIEEEMAIYSEESEGEIEKEEEKDINLTTRVHDWEAWKLELERVAPALRLKVSIDGRDWRARHAQMKTYRDELSERFKTTGSQLNKVYSNITSVMDKIGARENVLNERLEPLVREYGSLLDELNKVTNEYKEASVGVTERQEVLNELTSKVENMKQKTESRGSSMNDNSPLVTAKKAVDTLKKDIQELDFQIMILLWLLISKENPKSGNYLTYTETRFVGAEVY-