Monarch geneset OGS2.0

DPOGS208131
TranscriptDPOGS208131-TA3174 bp
ProteinDPOGS208131-PA1057 aa
Genomic positionDPSCF300154 + 330633-336745
RNAseq coverage386x (Rank: top 31%)
Annotation
HeliconiusHMEL0062180.089.21% 
BombyxBGIBMGA006572-TA0.078.33% 
DrosophilaPtr-PA0.063.43% 
EBI UniRef50UniRef50_Q86P360.063.43%Ptc-related n=28 Tax=Arthropoda RepID=Q86P36_DROME
NCBI RefSeqXP_001653012.10.070.86%hypothetical protein AaeL_AAEL001299 [Aedes aegypti]
NCBI nr blastpgi|1571172850.070.86%hypothetical protein AaeL_AAEL001299 [Aedes aegypti]
NCBI nr blastxgi|1571172850.070.86%hypothetical protein AaeL_AAEL001299 [Aedes aegypti]
Group
Gene OntologyGO:00160209.9e-196membrane
GO:00081589.9e-196hedgehog receptor activity
KEGG pathwayspu:5798872e-50 
 K12385 (NPC1)maps-> Lysosome
InterPro domain[53-856] IPR0033929.9e-196Patched
Orthology groupMCL10591 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208131-TA
ATGGGTTGCAAGCTAACATTTGTCGATGACATCTTAAACCGGTCGTTCTATAAGCTTGGTCTGGTCGTCGGCAAGCAGCCGGGGTACTTCATCATCATACCAGTTCTCTTAACCTTGCTCATGGTCACTGGCTACCAGCGGGTGCACTACGAAATGGATCCAGAATACCTTTTCTCCCCAGTCAGCGGTCAAGGGAAATTGGAAAGGAGTATCGTTGAAGAACATTTCAAAGTAAATTACTCCCATAGATTCAATGTTGGCAGAGTAACAAGAGCAGGTAGGTTCGGGCGAGTGATCATCATTGCGAAGGACAATCAAACCAACCTGCTGCGGACAGAAGTTTGGAAAGAGTTAAGACAACTGGATGAATATGTTCAGAATATAACTGTGACTTTGGAAGACGGTGAAACATTCACGTATAAAGAGGAATGCGCGCGATGGGAAGGACAATGCTTCGTCAACGATATTCTAAATTTGGATAAGATTATTGGAGAGGTTGAACGGGGCGAATTGAACTTGACGTTTCCGATAATGTTCAATCCTGTCACGTGGGAGGCTCATGCTTTTCCTGTTTATTTCGGCGGTTCAACTGTGGTAGATGACACTATAGTATCGGTGCCCGCCGTTCAGCTTGTATGGTTCATCAGGACTGACACTAAACTGCAGCAGCAACGAGGTGCAGCTTGGGAAGATGCGTTTCTGGATGCTGTCGGTGTAGTAGAAGATACAGGACGATTTAAACATATCTCGATAGCGCGATTCGCGTCTCGAACATTAGATCATGAACTTGAGAAGAACACGAGAACTGTCATACCATTCTTTAGCTCCACATTTATCCTTATGGGGATATTTTCAATAGTGACGTGTATGATGGGAGACTGGGTCAGATCCAAACCCTGGCTGGGTTTACTTGGAAATATATCAGCCGTGATGGCAACGATCGCTGCTTTCGGTTGTGCCATTTATTTAGGCATTTCTTTCATCGGTATAAACCTTGCCGCACCGTTCTTAATGATTGGAATCGGTATTGATGACACGTTCGTGATGTTGGCTGCCTGGCGGCGCACGTCCCCCCGTTTACCCGTCCCGGAGCGGATGGCTATCATGCTGTCAGAAGCTGCCGTCTCTATCACTATCACTTCGGTCACTGATATGCTGTCCTTCTTCATAGGCATCTTTTCACCATTCCCCTCCGTTCAAATATTTTGCATGTATTCAGGCCTTGCGGTTTGTTTTACGTTTGTATGGCACCTAACATTTTTCGCCGGATGCGTGGCCGTGTCCGGATACAGGGAGAAAAACAATCGTCACACTATTACGTGGTTGAAAGTATTGCCGGAATCTAGAGCTAGGAAAGAAGAGAAATCATGGTTATATAGAATTTTCTGCAGTGGTGGCATCGATCAGGCTGATCCAGACAACCCGATAGACAACAAAGAACATTGCATTATGGCCTTCTTCCGCACTACTATGGCGAATTTGCTCAATAATAGCTTCGTGAAAGCTTTAGTCATACTTATATTCCTAGGATACTTAGCTGGCGCCGGATATGGAGTGACGAATCTAAAAGAAGGTTTAGAAAGAAGGAAGCTGTCAAAAGTTGATTCTTATTCTGTGGAATTTTTTGACCGAGAGGATTTATATTACAGGGAATTTCCCTATCGAATTCAGGTTGTTATAAGCGGTAAATACAATTACTCCGATCCTAAAATCCAGGATGAAGTTGAGATTTTGACACAGAGATTAGAAAATACTTCATACATATCGAATTCTTTGTATACCGAATCCTGGTTGCGGACTTTCGTGAATTATGTTGAGAGAAACAACGATTATCTCAACATATCAATCGATTCTGAGGAAGACTTTATTAAGAATCTTAAAGAGTTGTGGCTGTTTTCGGCAAATCCATTTTCGCTCGACGTGAAATTCAATAAGGAGGGAGACCAAATTCTTGCATCTAGATTTCTTATTCAAGCTATCAATATTAGTGGAACTAACCACGAAAAGGAAATGGTTAAAGCTCTTAGAGAAGTCGTTGCCCAATCTCCACTCAACGCTACCGTATTTCACCCTTATTTCGTGTTTTTCGATCAGTTCGAGCTCGTGAGACCTACATCTTTGCAAAACCTGTGCTATGGAGCTTTGATGATGATGATAACTTCCTTTATATTCATACCCAATATACTGTGCTCATTGTGGGTGGCTTTCAGTATAATATCCATAGAAATTGGAGTAGTCGGTTATATGGCCCTATGGGATATTAATCTGGACTCAATATCAATGATAAACCTCATAATGTGCATTGGCTTCTCGGTTGACTTCACTGCACACATTTGTTATGCTTATATGGCGTCCAAAGCTAAGTATCCCAGAGAAAGGGTGAGCGAATGTCTCTACTCGTTAGGATTGCCTATTGTTCAAGGATCATTCAGCACGATATTAGGAGTTGTTGCATTACTACTCGCAGATAGTTATATCTTCTCGGTATTCTTTAAAATGGTATTCATGGTCATTTTCTTCGGTGCCATGCATGGTTTATTCCTCCTACCAGTTCTTTTGTCCCTCTTTGGTCCAGGGTCGTGTACAAGGGAAACGAAAGAGATAAAAATAGCAAAAGTAGACAAGATTTTCCCTAATCCGTATTGCTTACCGCATCCTCAATTGGTTCTGAATGATCAAATTTATAATGGGAAGAATATAAATCCAAACGGTATTTACAAAATATATGGAGACGACAAGGATCTTGGAATTGGCACGTCCGGTGAAGATACTAGTGAGAGCAGTTCGAATCAATCACAAAGACGTCAAATTAGCAGCGACGAAAACAGCAGAAAGAATTACGAAGACGGATGGAAGAAATTCGGCTATCATCAGAGCACAAGTCAATTTCAACCGTCAGGGGAGCTGGATTTGTATGAGCACGATCATGATAAGGCTTGGCAAAGACAACGCAACTATCGCAGTCAAGATAGTTACAAAAGACCAAGCCATAGAGATGGAGATTTTATAAGAACCAGAAAAACTAGCGATGCCGTTCCAGCCAATGAAGGGACATACAAGGTGATGAGAACCCACTCCCATCACAACCTTCACAGGCCTCGAGCTCCCAGACGAACAAACTCAACCCAGAATCTCGAGCACATTAACTACGTCGGAGAAATGCGCTTTCCTTGA

Protein sequence:

>DPOGS208131-PA
MGCKLTFVDDILNRSFYKLGLVVGKQPGYFIIIPVLLTLLMVTGYQRVHYEMDPEYLFSPVSGQGKLERSIVEEHFKVNYSHRFNVGRVTRAGRFGRVIIIAKDNQTNLLRTEVWKELRQLDEYVQNITVTLEDGETFTYKEECARWEGQCFVNDILNLDKIIGEVERGELNLTFPIMFNPVTWEAHAFPVYFGGSTVVDDTIVSVPAVQLVWFIRTDTKLQQQRGAAWEDAFLDAVGVVEDTGRFKHISIARFASRTLDHELEKNTRTVIPFFSSTFILMGIFSIVTCMMGDWVRSKPWLGLLGNISAVMATIAAFGCAIYLGISFIGINLAAPFLMIGIGIDDTFVMLAAWRRTSPRLPVPERMAIMLSEAAVSITITSVTDMLSFFIGIFSPFPSVQIFCMYSGLAVCFTFVWHLTFFAGCVAVSGYREKNNRHTITWLKVLPESRARKEEKSWLYRIFCSGGIDQADPDNPIDNKEHCIMAFFRTTMANLLNNSFVKALVILIFLGYLAGAGYGVTNLKEGLERRKLSKVDSYSVEFFDREDLYYREFPYRIQVVISGKYNYSDPKIQDEVEILTQRLENTSYISNSLYTESWLRTFVNYVERNNDYLNISIDSEEDFIKNLKELWLFSANPFSLDVKFNKEGDQILASRFLIQAINISGTNHEKEMVKALREVVAQSPLNATVFHPYFVFFDQFELVRPTSLQNLCYGALMMMITSFIFIPNILCSLWVAFSIISIEIGVVGYMALWDINLDSISMINLIMCIGFSVDFTAHICYAYMASKAKYPRERVSECLYSLGLPIVQGSFSTILGVVALLLADSYIFSVFFKMVFMVIFFGAMHGLFLLPVLLSLFGPGSCTRETKEIKIAKVDKIFPNPYCLPHPQLVLNDQIYNGKNINPNGIYKIYGDDKDLGIGTSGEDTSESSSNQSQRRQISSDENSRKNYEDGWKKFGYHQSTSQFQPSGELDLYEHDHDKAWQRQRNYRSQDSYKRPSHRDGDFIRTRKTSDAVPANEGTYKVMRTHSHHNLHRPRAPRRTNSTQNLEHINYVGEMRFP-