Monarch geneset OGS2.0

DPOGS200651
TranscriptDPOGS200651-TA4764 bp
ProteinDPOGS200651-PA1587 aa
Genomic positionDPSCF300076 + 754851-764456
RNAseq coverage186x (Rank: top 49%)
Annotation
HeliconiusHMEL0038570.081.50% 
BombyxBGIBMGA011326-TA0.075.76% 
DrosophilaCG42788-PB0.045.11% 
EBI UniRef50UniRef50_A0NDB90.053.28%AGAP003128-PA n=3 Tax=Culicidae RepID=A0NDB9_ANOGA
NCBI RefSeqXP_001359323.20.044.85%GA16165 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|3479693020.053.28%AGAP003128-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1984537510.044.88%GA16165 [Drosophila pseudoobscura pseudoobscura]
Group
Gene OntologyGO:00055152e-15protein binding
KEGG pathwaydre:4031292e-40 
 K05404 (TLR7)maps-> Toll-like receptor signaling pathway
InterPro domain[209-434] IPR0197491.6e-33Band 4.1 domain
[301-429] IPR0197483e-30FERM central domain
[39-159] IPR0014782e-15PDZ/DHR/GLGF
[31-65] IPR0012023.6e-11WW/Rsp5/WWP
Orthology groupMCL17410 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200651-TA
ATGGATGCGAACCCAACTCAAAGTCACGTCACGAGGACCGCGAGCTGGTTACCGCCGGCGGAGAGTTGGCCCGTCAAGGATGCAGAGAGTCCGGACGAAGAGGATCTGCCCTACGGCTGGGAACAGGCGCTCGATAACAGAGGGAAACCGTACTACATCAATCACCTTAATAAAACAACCACATACGTGGCACCGGAAGGCTGTTCCTGCGAGTCACCACCAGCTCCTCGGGATGTGGTTCTCGATAGAGATCCAGAGCTCGGCTTCGGTTTCGTCGCCGGTTCAGAAAAACCTGTTCTAGTTCGTTTTGTCACCGACAACTCACCGAGTGTCGGAAAGTTGGAACCTGGTGATCAAATCCTTTGTGTAAATGGCGAGGACGTCAGCACAGCAGCTCGTGAACACGTCATCTCAGCAGTACGAGCCTGTACGAACCGTGTCACCCTGAAAGTGTGTCAGCCAGCTCGTAGCGGTGACCCGGCAAGGAAGTCTGCCTTCCTCTCTGCCGCTAAACGAGCCAGACTACGAGCAAGGCCGCCCAGGGTCAGGTTTGCAGATTCGGTTCAACTAAATGGAGCACCTATGTATCCACCATCAGCGTTCTCTCTTGGCGACCTCTGCCTTCCTCCTATGGCTAATGTCCTAAAGGTGTTTTTGGAAAACGGTCAAACGAAATCGTTTAAGTATGATACCACCACCACCGTGGCTGACGTGGTGACCAGCTTGAAGGACAAGCTCTGTATAACAGCCGAAGAGCATTTTAGCTTGGTCGTTGAACATGTGAAGAGCTTGAAGAGGAATAAGCTCACATTGTTGGATCCTAAAGAGTCTTTGGCCAGGATCGCAGCTCGTCCCGGTTCTCATAAGATGCGATGCCTGTACCGCGTGGTGTTCATGCCGTCGTCCGCTGCGGAGTTAGCGCAACGCGACCTGGCAGCCCTGGACTACCTCTACACTCAGTGCTGTAACGACGTGGCCCAGGATCGGTTCGCCCCGGAACTGCAGCCCGACGTCGCGCTGCGACTCGCTGCACTCCACATACACCAACACGCGCTCGCTCACAACATGTCCCCCGCTAAACTGACTGTCAAAGTTGTCGAGCGAGAGTTTGGTCTGGAGCGTTTTGTTTCCAGCAGTCTCCTGGAGAGCATGAAGCGGAAGGAACTCCGACGACTGGTGGCGCACTTCCTCAAACTGAACGCTCAGATGACGGGAGGGCAACGGCATCTCACACAGTTACAGGCTAAGTTGCATTACCTGGACACGGTCAGCGCTCTGCCCAGCTACGGCGCGAAGTGTTTCAGCAGCGCGCGGCCTGAGAGAGTGTTGCTCGTGTCCCCTCGTTTTGGACTGTCACAGATCGTCGGGAGAAGCAACTCCGTGCCTCAAGCCATAGCTTCTATAGAGGAGGTGGAGTGTGTTCGTGTCCGCGGGTCCTGTGTGTCGGGGGGCGGAGCGGACGTGGGGGTATTCCTGACCCGGGATCGCGTGCTCGCTTTGCTCATGGATGACCGCGACGCCGCCGAGATGCCACTGGTCATCGCCGGTTACTACCGCCTCGCTACGGGTCGGGAACTGAATATAGAATTGGAGAGAGAACCCATAACCGAGGATATAGCACCGCCGTATCTATCACAGCACAACGTGGTCCCGACCAAATGGAGCTATTTACACTATAACGATCCTATAATGCTAACGAAGAAACATTATGCGATCTTCAGTATGCCGCCGCCTTATCACTCCACACCGGATCAGGCCAAGGCCAGCTTAGACACCAACATGAACACGAGCATTGCGAACAGGAATCACAGCAATAACTCCAGTCCTTTAGTCGGCTACGATTCCAAATCCGGTCTGCTCGGATACGATATACATTTTGAGAAGTGTAAAAGAAATGAGGATTTCCGTCTTTTCGATCCGAACGACGCTTGCTACCTCGACGACGGCATGGGATTTGATCTACAAAGTGTTCTGTCGATGGAATTGTTGGAGAACGCAAACAACCCCCGATTAGTGGAGGCCAAGAATGAGGAGGTGTTGCGAAGGGTGGCGGAAATGCAAAAACTAGTAGAGAATTCTGAACAGTATTTGACTGAAAATTGTGATGTTTTTAATAATAGCGGTTATAACGGTGTTCTTAGAGACGAATTAGTGGGTAAAGAGACTTGCGTAGATAATATTGAAAGTGATACGGAGTCGAACAATAGTAAGATGTCCGCTGCGGATGACGCGCCCGGAGTTCTTAAGCATAGTGACTCCTTGCTCCTGTTGACTGAAACCATTAACCAAGGACTGAATAGCGGTAATGTAGAAACGCAAAAACAGTATACTTCTACAGATCTAACTCAAAGACAATCTCAAGGACTTAGCGCAATTCTCAATAATTGTGGATTCACAGACGCTTTGTTAGCTCTTAATAATGACGTTGGCTTCTCTGAGAGTGACAACGATTCAACGTATACTCCTACGAGTAGTCCAATACGGAAACATCAAAATAAATCGACCAATAAAGCAGTTAGGACTAGTTTTGGTCTGCATAGCCCAGACGCAACATTAGACACTAAAGAATCAAATTTAAAACAATACTTAAAGCAGCTCAGAGAAAAGAGCGACAGGGAGGAAAGTGCAGCTGAATTTCCGTATTTTTACCAAGAGAGCTTAATAGAAAACGACGCCGAGCTTATAGACTTAACATTAATACCCCCACCGCAAACGCCTGATGAATTAGACTGCGCAACGCAAGTGCCTCAAATTTTACCTATCGTACCTCCTTCCTTTGCCGATGATAAGGACACAGAGCAAGACAATAGCCCTAAGAGTAAAGAGCTAGAGGAATTCATCGCTAATGTTACGGTACAACCCCCGACAGTTAAAATAACTCCAGCAATAGAGCTCACGCCGGAGGAAATAATGTCATATATCATACCACCGCCGCCGGGTTCAAACGCGTCTACTATAGATAGAGATTCAATTAATACCGTAAGCCAAGAAAAAAGCGGTAACAGCAACGAAGAACATAATAATACTAATGAGAAAATATCTAACGGTGAGGTAGTTAAAGGCGTCTTGAGTGTGAGCGAGATCAGAAATATGTTTGCTTCAAAACAATTAAGTGACGCTAGAAACAGTTTATTATTAAAAGAAAAAAATAAGGCCGACTCTCCGAAGAGCAAGCGAAAAAGTATAGCCGGAGATAAACAATCCAACGGCAACGCACATGTTATAGAATACCCCACTGTTGAGAGAAAAGGTATGTTCTCATGTTGCAGCAAAGACAAAAATAAGAACGATGATCTTTGTGAGGAAACAGAGCAAATTGAAGGTCATATAACGAACTCAGATTCGATGCCTGATGTTTGTGAAATCCGACCACCTCCGAGAAGAAAAGGCAGTGAAATTCGGAAACCTCCAGAACGCCCTCCGAAAATGCCTCCCACGCCGTCTCCGAGACCGAGATCGAATTCATTCACGTGCCCCAACAATGAACTCAGCGCTGTAGAGTTCACAAGCACTCATTTCAGTACGTTAAATAATAAAAAGTTAAATTATAGAGTGGTCTGTGAAATGAATGAAGGGATACATACGCCTCCCCAAATCCCACCACGGATAGAAAATAGAACACTTTCTCCTCCCCCACCATTATTACTGCCACCTAAGAAACCTCCCCTACCCCCCGTTCCAAGTATAGAAGTTTTAAGAATTAAAAATTCCCAGAAACTATCACCCGTTCGGGCGACCGAGCGGCTGGCTAGTATCGGGTCACCACATTTTCACAGGAATCTTAACACTTACCGGAACCTCGAAAACGAAAACAAACTAACCGATGACGAATCCCACAATCCGAGATCTCAGAATAACAACTCTATGACGCTACAAAATCGCCACGTACGTTCAAATTCTGAGACGAAGAGTACCATATTGAAAAATAAGAACATATCAACAGGCCTCTCTCTGTGTTCGCCTCAAATGAACCGTCGGTTCAGCAACAGACCTGACCTGTTAAACGAAGCTCCCGTCGGCGATCAGAGACTATTCAGTCCTTCACTCGCTGAACGGAAGACCCTCAGACGGGACAGCGCGAGTCCGACGGGCAAGGTTCAGAATCATGTGAGGTTTAAGGAAGACGTCGTAGACGACATCCCGACTCCACCTTCTCCACCTACAGTCAAGTTCCCAGAGTTTCAATCTGGTAACAACGGACATGTGTCCATAGAGAACTTATTGTGCAAAACTGAAGTGGCCGTTGACGGGTTACTGCAGAGATTACACCAGGTGGCGCAGAAATGTTCACACCAACACGCCCACGGAGGAGGCGAAGACATCGATGAAATTAAATTCCAGCGAGCTCGCTCAGAACTAACTTCGTGCGCGGTGTCTTTGGTGGGAGCGTCCCGTACACTAGTGGGGGCTTTGGGAACCTCTCAGGAAGGTCCCCCAGGGCTGGCGGCTCCCGCCGCCCTACCTGACTGCCTTACACCGTTAAGACGCCTAACGGATTTGGCACAGGCTTTGGGGAAGCACACTTCAGCACCGCTCCAAACTAGAAATTTGATCCTTCGAGTTCATGACGTCACCGCCGCCTTTAAAGACTTAGCAGGTGCGGAAATGGCGCAGATTATACACGAACATAACTCCAAAGATGGAACTCAAGGCCAGGGTCAGGACTCGGCTTCCACCCTCGAGGGTCAACTGGCCCTCCGAGCAGAATGCCTGGCGAATGTCTTGGCGACTCTACTGCGGAGTCTTAGGGTATTCTCACCTTAA

Protein sequence:

>DPOGS200651-PA
MDANPTQSHVTRTASWLPPAESWPVKDAESPDEEDLPYGWEQALDNRGKPYYINHLNKTTTYVAPEGCSCESPPAPRDVVLDRDPELGFGFVAGSEKPVLVRFVTDNSPSVGKLEPGDQILCVNGEDVSTAAREHVISAVRACTNRVTLKVCQPARSGDPARKSAFLSAAKRARLRARPPRVRFADSVQLNGAPMYPPSAFSLGDLCLPPMANVLKVFLENGQTKSFKYDTTTTVADVVTSLKDKLCITAEEHFSLVVEHVKSLKRNKLTLLDPKESLARIAARPGSHKMRCLYRVVFMPSSAAELAQRDLAALDYLYTQCCNDVAQDRFAPELQPDVALRLAALHIHQHALAHNMSPAKLTVKVVEREFGLERFVSSSLLESMKRKELRRLVAHFLKLNAQMTGGQRHLTQLQAKLHYLDTVSALPSYGAKCFSSARPERVLLVSPRFGLSQIVGRSNSVPQAIASIEEVECVRVRGSCVSGGGADVGVFLTRDRVLALLMDDRDAAEMPLVIAGYYRLATGRELNIELEREPITEDIAPPYLSQHNVVPTKWSYLHYNDPIMLTKKHYAIFSMPPPYHSTPDQAKASLDTNMNTSIANRNHSNNSSPLVGYDSKSGLLGYDIHFEKCKRNEDFRLFDPNDACYLDDGMGFDLQSVLSMELLENANNPRLVEAKNEEVLRRVAEMQKLVENSEQYLTENCDVFNNSGYNGVLRDELVGKETCVDNIESDTESNNSKMSAADDAPGVLKHSDSLLLLTETINQGLNSGNVETQKQYTSTDLTQRQSQGLSAILNNCGFTDALLALNNDVGFSESDNDSTYTPTSSPIRKHQNKSTNKAVRTSFGLHSPDATLDTKESNLKQYLKQLREKSDREESAAEFPYFYQESLIENDAELIDLTLIPPPQTPDELDCATQVPQILPIVPPSFADDKDTEQDNSPKSKELEEFIANVTVQPPTVKITPAIELTPEEIMSYIIPPPPGSNASTIDRDSINTVSQEKSGNSNEEHNNTNEKISNGEVVKGVLSVSEIRNMFASKQLSDARNSLLLKEKNKADSPKSKRKSIAGDKQSNGNAHVIEYPTVERKGMFSCCSKDKNKNDDLCEETEQIEGHITNSDSMPDVCEIRPPPRRKGSEIRKPPERPPKMPPTPSPRPRSNSFTCPNNELSAVEFTSTHFSTLNNKKLNYRVVCEMNEGIHTPPQIPPRIENRTLSPPPPLLLPPKKPPLPPVPSIEVLRIKNSQKLSPVRATERLASIGSPHFHRNLNTYRNLENENKLTDDESHNPRSQNNNSMTLQNRHVRSNSETKSTILKNKNISTGLSLCSPQMNRRFSNRPDLLNEAPVGDQRLFSPSLAERKTLRRDSASPTGKVQNHVRFKEDVVDDIPTPPSPPTVKFPEFQSGNNGHVSIENLLCKTEVAVDGLLQRLHQVAQKCSHQHAHGGGEDIDEIKFQRARSELTSCAVSLVGASRTLVGALGTSQEGPPGLAAPAALPDCLTPLRRLTDLAQALGKHTSAPLQTRNLILRVHDVTAAFKDLAGAEMAQIIHEHNSKDGTQGQGQDSASTLEGQLALRAECLANVLATLLRSLRVFSP-