Monarch geneset OGS2.0

DPOGS202221
TranscriptDPOGS202221-TA1320 bp
ProteinDPOGS202221-PA439 aa
Genomic positionDPSCF300149 + 278960-280617
RNAseq coverage234x (Rank: top 43%)
Annotation
HeliconiusHMEL0092014e-4841.94% 
BombyxBGIBMGA013496-TA6e-8941.63% 
Drosophilauri-PA1e-1430.18% 
EBI UniRef50UniRef50_B0W3P54e-2225.31%Putative uncharacterized protein n=1 Tax=Culex quinquefasciatus RepID=B0W3P5_CULQU
NCBI RefSeqXP_001843329.17e-2325.31%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3838539826e-2235.11%PREDICTED: uncharacterized protein LOC100876942 [Megachile rotundata]
NCBI nr blastxgi|1700309081e-3525.47%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00162725.8e-16prefoldin complex
GO:00064575.8e-16protein folding
GO:00510825.8e-16unfolded protein binding
KEGG pathway 
InterPro domain[10-113] IPR0041275.8e-16Prefoldin subunit
[1-112] IPR0090531.5e-11Prefoldin
Orthology groupMCL26082 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202221-TA
ATGAATTTCCTTAGTGATATATATCAGAAAAGTTTACATGAAAACGAAAAAAATATTAAATTCTGGGAAGAATATTTACAAAATCTAAACACTCTCAACTTTGAAATCTATTCTGATAAGTTGTCAGTACCAATATTAGTACCTATTGGAAATAAAATATTATTTAGGGGAGCATTAAAACACACAAATGAAGTGACCGTAGCCTTAGGCGCGGATTATTTTGCTAAATGTTCTATTAAACAGGCTGAGGTACTGAGGCAGCACAGAATTAAAGATGCAAAATCAAAACTTGAAGAATATAATAAAGAGAAAGAATATCTTGAAAATCAACTATCTTTTGGAAAACAAAATGTATTTGGTAATACTGATCAGGACATTATTGAAGTTTGTACTGAAGAGGAAGATAAAACGTGGAGGGAGCAGCACAGAGAGAGACTGAAACGATATCATCAGAGTGACGATAAGAAGAAAGAAAACATAAGTAAAGACATCTCCGACGAAGAACTGTGGATGAGGTTGGAGGAGTTGGAGTTACAGGAGGAAATGCAAAATGAAATGCAAAACTCAGATACTGTAGAGGAAACCAATAGTTATGAATCAAACTTTGATTCTTGTGTCATCAAAGAATCTGATGTTAAAGATGTCAGAGCTGATGAAGCTACAGAGTTAGCCTCAAGAATAACTCACAATGTTCCAAAGCAAACTTCAAAGACAGATTTATTACAGCAAGTTCTCGACAGACAGGAAATGCTTTCGACTAAACTGACCGAACTCAAGAGTAGGGATCGGCCGGAAACTACTACGGAGAGTGAATTGTTGTCCAGATTAGATGAAATTGAACTCCTGGATGACCTTGAGGATGAAATGGATAGAATAGATGATATATTAGAAACTGCTGAAGATGAAGATTCATCAAGAAGTGACACTTCCAAGTCGAGTAAAAGTGTGTCCTTCACTGACGAAGATGATGGAAAGACATTGGAATTAACGTTCACACATACGGATGTGGAACCAGATATGACACCGTATGATCCAGAAAAGGGCATCATGAAGCCGAGGGATATTTATGTAGCGTGTGCAAACCTATTTAATAATGGAACAACATCCATATTAAGAAAATCTAAATATTTGGATAAATCAGCCAACGTAACAAAGGAAATGGAAGCGCCGCAAGCAGTGAATAAGAATGGTATTACGGATACGGAGAGACAAGAAATAGTTGTGAGGGATGTTGTTGAAAAATCGGCATCTCAGGAAAATCTAACAGCCAGTGCGAGACCCACCAGCCTCTTCAAACAGAAGCGACAGCAGAAGTCTTAA

Protein sequence:

>DPOGS202221-PA
MNFLSDIYQKSLHENEKNIKFWEEYLQNLNTLNFEIYSDKLSVPILVPIGNKILFRGALKHTNEVTVALGADYFAKCSIKQAEVLRQHRIKDAKSKLEEYNKEKEYLENQLSFGKQNVFGNTDQDIIEVCTEEEDKTWREQHRERLKRYHQSDDKKKENISKDISDEELWMRLEELELQEEMQNEMQNSDTVEETNSYESNFDSCVIKESDVKDVRADEATELASRITHNVPKQTSKTDLLQQVLDRQEMLSTKLTELKSRDRPETTTESELLSRLDEIELLDDLEDEMDRIDDILETAEDEDSSRSDTSKSSKSVSFTDEDDGKTLELTFTHTDVEPDMTPYDPEKGIMKPRDIYVACANLFNNGTTSILRKSKYLDKSANVTKEMEAPQAVNKNGITDTERQEIVVRDVVEKSASQENLTASARPTSLFKQKRQQKS-