Monarch geneset OGS2.0

DPOGS208892
TranscriptDPOGS208892-TA2637 bp
ProteinDPOGS208892-PA878 aa
Genomic positionDPSCF300009 - 914386-917022
RNAseq coverage211x (Rank: top 46%)
Annotation
HeliconiusHMEL0157850.070.06% 
BombyxBGIBMGA006572-TA1e-3824.86% 
DrosophilaPtr-PA4e-4924.38% 
EBI UniRef50UniRef50_Q7PXL50.045.62%AGAP001468-PA n=2 Tax=Anopheles RepID=Q7PXL5_ANOGA
NCBI RefSeqXP_001862575.10.045.67%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3479659770.045.62%AGAP001468-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479659770.045.62%AGAP001468-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160208.7e-78membrane
GO:00081588.7e-78hedgehog receptor activity
KEGG pathwaytps:THAPSDRAFT_2682702e-58 
 K12385 (NPC1)maps-> Lysosome
InterPro domain[223-843] IPR0033928.7e-78Patched
Orthology groupMCL17514 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208892-TA
ATGAAATTATGGGTGCCTCCAGAATCAGACTTTTATTATGACACAAATTGGTACATAGATAATTTTGGAACTAGTTTAAGAATGCAGAAATTGTTAATTACTGCAGACAATGTACTAGATCCACATGTTATACATTTGTTAAGCAATATCACAAATGAAGTCTCATCCATTCAAATTCATTATAATAATAGAACATATTCTAAAAATGACTTGTGTTTTAAAGTACCTGTAGTTGCCTTTGTTAGTCCAAATTGGAAAGCTAGATCTGAGGTGCTTGTTAAATCTAATAATAGTCAATCTAAACTTAAACATGACGGATATGATATAGATTACTATGATCCATCTCTCTTAGTTGACAATGATTTTTATTGTAGTTTTATTGAGAGTTTCTCACATTCTTGTTATCAGGATAGTATTGTAGATATTTGGAAAAATGATGAAATGTTAATAAAAAATCTAACAAAGTCTGATATTATTAAAAATGTTAATGAGGTGAAAATAAATCCTGTTACTGGCCATTCTGTTGATTATACAAAGCAGTTAGGGGGAGTTGAGCGTGATGAAAATGGTCTAATAGTGTCAGCCAAGTCAGTACTGATTACTTGGTATATGTATGTTAATATGTCAGAGGTTGATCTCAATGAAGTAGGAAACTTAGTTGGTACAGAAGACTGGGTGACAGTCCCATTAGCCATGTGGGAGAAAAAATATTTAAAATATGTGAGTAATTTATCATCACCAAAAAATATAAAGTTTTTTTATGAAACAGGAGGAAGTTTTGCAGATATAAGTGGTGAAACAATGTTTAATGACATGGATAAACTATCTATAGGTATTATGTTAATGTTTTTTTATGTTGTGATGGCAGTTTCTCGCTTCAATTGGCTAGAGATTAGGTTGACATTAGGTGGTGTAGGTTTACTGAGTGTTGGTATGGCATATATTACTACTGTGGGCTGGTGTTCCTTGATTGGTATCCCATTTGGCCCTGTTCACTCATCATTACCATTTCTTCTTATGGGCCTGGGAGTAGATGATATGTTTGTGATGAATGCATGTTGGAAAATAGTTTTGCAATCAGAGTCACACCGAAGTATTCCTGTTAAAGTAGGTCATATGCTGAAGCATGCAGGTGTGTCAATAGTAATAACATCCTTCACAGATATTGTTGCTCTATTGATAGGTGCCATAACAATTCTTCCTTCTTTGAAATCCTTCTGTATCTATGCTGCAGTTGGTGTATTTTTCATATTTTGTTATTCTGTCACTTTTTTTGTTGCAGTTTTCACAATAGACATAAAAAGGATTCGTGATAAGAGAAATGGAATTATATTCTGTTATAAACATAATAATGATGTCAATGTATCATCAAAAACTACATTTTTCCAAAAGATTTTAGAAAGTTTCTATAAAAATATTGTTTTTACTATTCCTGGTAAAGCCACAGTCATCTTATTTGTTTTAATAGTAACAGGTGTTAATATAGCAGCTGTATTAAAATTGGAACAAAAGTTTGACCAAAAGTGGTTTATTCCTGATGATACTTATTATAAACAATTTTTGAACACCCATGAGCACTACTATCCTGATGAAGGTTATCCAGCTATGGTTTTCTTAGGAGATATGGATTACTATAAAGAATTTAATAATTTGTACAATATGATACAGGTTTTACGGAATGAATCATATGTTACTGATGTTGTCACATGGGTAGAAACTTTTCATGGATATGTCTTAAAGAATTTTAACCACAACTTACTGAATTCAAGTTCTATTACAGAAGGCCAATTTCTAAATTATTTGTCCAGATTTATATACAGTGGAGTTGGAGGTAGATTTCAAGTAAATTTTAAATTTTCGGGGCCACATGCTTGTGGTAAAACTATTGATAATATAAGGGCCACAACATTATCTTTTAGATTCACAAGTTTCAAGGGTCCTCAGGAGTATATACCCGCAATGAATCATGTTAAAGACATTGTAAAATCTGCATCCATAGCTACTGGTGATGGTTACCGGAGCGTCTGGTCCAAGGCCTTCGCAAATTGGGTCACTGACGAGATTATAGCTGTTGAAGTGGAGAGAAACATAGAACTAGCATTGCTTTGTGTCATGCTCTGCACTGTGATATTAATTACAAATCTTCAAATGTGTTTATGGATATTCATTTGCGTTTTACTCACAATTGTAAATGTATTAGGAGGAATGCAACAGTGGGGTATGACAGTTGATATCGTGTGTTGCATTGGTCTAGAACTTGCAATTGGTCTTTGTGTTGATTATGCTGCACACGTTGGGCATACATTTTTAACTATGACCCAAGGCGATCGTGGCGAGAGAGCATACAACACAGTCACATCTATCGGCAGCGCAGTTCTGCTAGGCGGTGGTTCGACTTTCCTTTCTTTATCTCTTCTAAGTATGTCGAAAGCGTATACATTTCAATCTTTCTTTAAGATATTTTTGCTGGTAATACTATTTGGTTTATTTAATGGCTTGTTATTTCTACCTGTCGTTTTATCATTAATAGGTCCAGCACCTTACAAAAGTCGCGATGAAAACGTTTTGGAAGCTATAGAACTAAATGGTAAAACTCCTGACAATAAAAAAATGTTAGCTAAGCGAGTAGAGTCTTGA

Protein sequence:

>DPOGS208892-PA
MKLWVPPESDFYYDTNWYIDNFGTSLRMQKLLITADNVLDPHVIHLLSNITNEVSSIQIHYNNRTYSKNDLCFKVPVVAFVSPNWKARSEVLVKSNNSQSKLKHDGYDIDYYDPSLLVDNDFYCSFIESFSHSCYQDSIVDIWKNDEMLIKNLTKSDIIKNVNEVKINPVTGHSVDYTKQLGGVERDENGLIVSAKSVLITWYMYVNMSEVDLNEVGNLVGTEDWVTVPLAMWEKKYLKYVSNLSSPKNIKFFYETGGSFADISGETMFNDMDKLSIGIMLMFFYVVMAVSRFNWLEIRLTLGGVGLLSVGMAYITTVGWCSLIGIPFGPVHSSLPFLLMGLGVDDMFVMNACWKIVLQSESHRSIPVKVGHMLKHAGVSIVITSFTDIVALLIGAITILPSLKSFCIYAAVGVFFIFCYSVTFFVAVFTIDIKRIRDKRNGIIFCYKHNNDVNVSSKTTFFQKILESFYKNIVFTIPGKATVILFVLIVTGVNIAAVLKLEQKFDQKWFIPDDTYYKQFLNTHEHYYPDEGYPAMVFLGDMDYYKEFNNLYNMIQVLRNESYVTDVVTWVETFHGYVLKNFNHNLLNSSSITEGQFLNYLSRFIYSGVGGRFQVNFKFSGPHACGKTIDNIRATTLSFRFTSFKGPQEYIPAMNHVKDIVKSASIATGDGYRSVWSKAFANWVTDEIIAVEVERNIELALLCVMLCTVILITNLQMCLWIFICVLLTIVNVLGGMQQWGMTVDIVCCIGLELAIGLCVDYAAHVGHTFLTMTQGDRGERAYNTVTSIGSAVLLGGGSTFLSLSLLSMSKAYTFQSFFKIFLLVILFGLFNGLLFLPVVLSLIGPAPYKSRDENVLEAIELNGKTPDNKKMLAKRVES-