Monarch geneset OGS2.0

DPOGS202059
TranscriptDPOGS202059-TA1308 bp
ProteinDPOGS202059-PA435 aa
Genomic positionDPSCF300053 + 882302-890694
RNAseq coverage66x (Rank: top 67%)
Annotation
Heliconius% 
BombyxBGIBMGA012535-TA3e-8569.81% 
Drosophilahh-PA2e-3739.72% 
EBI UniRef50UniRef50_UPI000223F1373e-3975.53%UPI000223F137 related cluster n=1 Tax=unknown RepID=UPI000223F137
NCBI RefSeqXP_001605475.16e-4178.35%PREDICTED: similar to hedgehog [Nasonia vitripennis]
NCBI nr blastpgi|41767725e-4386.60%hedgehog protein [Junonia coenia]
NCBI nr blastxgi|3092539792e-4187.76%hedgehog [Bicyclus anynana]
Group
Gene OntologyGO:00082338.5e-62peptidase activity
GO:00065088.5e-62proteolysis
GO:00072751.7e-47multicellular organismal development
GO:00072671.7e-47cell-cell signaling
GO:00071542.6e-37cell communication
KEGG pathwaynvi:1001218662e-40 
 K06224 (HH)maps-> Hedgehog signaling pathway
InterPro domain[237-429] IPR0017678.5e-62Peptidase C46, hedgehog protein, hint region
[15-105] IPR0090456.5e-50Hedgehog/DD-peptidase
[15-101] IPR0003201.7e-47Hedgehog, N-terminal signaling domain
[14-32] IPR0016572.6e-37Peptidase C46, hedgehog protein
[240-340] IPR0035872.9e-21Hedgehog/intein hint, N-terminal
Orthology groupMCL10543 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202059-TA
ATGCAGAGTAAACACCAAGCATCCCGTCTCTCAGACCTGCCCGTCGAATCGAGATGCAAGGAGAAGTTGAATACTCTCGCTATCAGCGTGATGAATCAATGGCCAGGGGTTAGGCTGAGGGTGATCGAGGGTTGGGAGGAAGAAAACTCCCACCTGGAGAACTCCCTCCACTATGAGGGCCGGGCCGTTGACCTCACCACCAGCGATCGCGACCGCAGCAAGTACGGCATGCTGGCTAGACTAGCTGTTGAAGCCGGCTTCGACTGGGTCTTCTACGAAAGCCGCTCCTACATCCATTGTTCTGTTAAGACAGCAGTCTCCGATTTTTTGGAGTCCAAAATGAATGGGTACATAGCAATTGAGAGTATGGACCAGATCTTAAAGGCCAAATATATAGTGAAGGTTTACAAGAAGAATGCACCTCGGACAGCTTCCCGCGCCCGCTCTCAGGCGGTGTCGATCAAACGGACGAACAAGCCTCTTGTTAACTACCGTCCCACTCTAACATCATATAAAACATATGTCCATCCCATTCTGACACAGCCGCATCGCAGCCAATCCCCCACTTCTATAATTACGTCGGCGCCGCCTAGCGCTGCCGGTGGTCCGTCGGGGCGGAGTCCTACGGGCCAGTACAAGCGACCCTCCGCTCCCGCCTCCGCTCCCGCTTCTCTTATCATCACATCATCAGGCTTACAATCTTCCGTAGGAACAGGAGCAGGCTGCTTTCCTTCTGGTTCTTTGGTCCATACAGAAAAAGGACCTAAAAACATCGACTCTCTTCAAAAGGGTGATCGAGTTTTAGCGGCCGATAGTGATGGAAAGTTAGTGTATAGTGAGGTTTTGACTTTCATTGATCGAGATCCGAACGCTGTACGTCAATACATCGAATTGACGGCCGAAAATAATGCTACTATTACGACGACGCCATCCCACCTCTTGCTGCTAGCCGCTGCTGATGGTTGGCGGGAATCATTTGCTGATAATGTAGAGATTGGCGATTTCCTTCTAACAAGAGGACAGGGAAGCGTGATGCGACCTTCAAGAGTTGTTAACATTAGAAGGGTTTCTAAACTTGGGGTATTTGCACCTCTAACGAGAACTGGAACTATCATTGTGGACGATGCCTTAGCGTCCTGCTATGCTCTCATCAATAGTCATTCCATTGCCCACGCTGCCATGGCACCCTTGAGGTGGTTGGCCCAATGGAACAGAACACCTGAAGTTAAACGCGGTGTCCATTGGTATGCTAACGCTCTATATAATATCGGCGATTTCGTGCTACCCTCATCGTACAAGTATCGCTAA

Protein sequence:

>DPOGS202059-PA
MQSKHQASRLSDLPVESRCKEKLNTLAISVMNQWPGVRLRVIEGWEEENSHLENSLHYEGRAVDLTTSDRDRSKYGMLARLAVEAGFDWVFYESRSYIHCSVKTAVSDFLESKMNGYIAIESMDQILKAKYIVKVYKKNAPRTASRARSQAVSIKRTNKPLVNYRPTLTSYKTYVHPILTQPHRSQSPTSIITSAPPSAAGGPSGRSPTGQYKRPSAPASAPASLIITSSGLQSSVGTGAGCFPSGSLVHTEKGPKNIDSLQKGDRVLAADSDGKLVYSEVLTFIDRDPNAVRQYIELTAENNATITTTPSHLLLLAAADGWRESFADNVEIGDFLLTRGQGSVMRPSRVVNIRRVSKLGVFAPLTRTGTIIVDDALASCYALINSHSIAHAAMAPLRWLAQWNRTPEVKRGVHWYANALYNIGDFVLPSSYKYR-