Monarch geneset OGS2.0

DPOGS206421
TranscriptDPOGS206421-TA1074 bp
ProteinDPOGS206421-PA357 aa
Genomic positionDPSCF300181 + 103906-110207
RNAseq coverage38x (Rank: top 73%)
Annotation
HeliconiusHMEL0181002e-15894.24% 
BombyxBGIBMGA013787-TA2e-14384.80% 
DrosophilaWnt2-PA4e-6340.43% 
EBI UniRef50UniRef50_E2A1W92e-10560.76%Protein Wnt n=11 Tax=Protostomia RepID=E2A1W9_CAMFO
NCBI RefSeqXP_002423103.12e-12965.85%protein Wnt-4 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3789404917e-15694.58%WntA signaling ligand, partial [Heliconius erato lativitta]
NCBI nr blastxgi|3789404911e-17094.58%WntA signaling ligand, partial [Heliconius erato lativitta]
Group
Gene OntologyGO:00072751.7e-160multicellular organismal development
GO:00160551.7e-160Wnt receptor signaling pathway
GO:00055761.7e-160extracellular region
GO:00051021.7e-160receptor binding
KEGG pathwaycfa:6123679e-87 
 K00408 (WNT4)maps-> Basal cell carcinoma
    Pathways in cancer
    Wnt signaling pathway
    Melanogenesis
    Hedgehog signaling pathway
InterPro domain[44-357] IPR0058171.7e-160Wnt
Orthology groupMCL17050 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206421-TA
ATGCGCTTCCCTGGTGCACGTGCGGCACTCAAGGTCTCCCAACCCTTTACACATCCTCTCAGGAACCTTGCAGCTCCGGTTAAGACGACACAGTCATCAAACACGTCCTTGGAGACGTACTCAACATTGCAGAAGGAAGCCTGCCACAGGTTGGATTTCTTGGTTGAGCGTCAGAAACAATTATGTATGCTATCCGACAAGATGGTTCAGGTGTTACAAACCGGGGCCCAGCAGGCCGTCGAGGAATGCCAGTATCAGTTCAGGAACAGCCGTTGGAATTGCAGCACAGTTGAGAATTCAACTGACATCTTCGGAGGCGTACTTAAATTCAAATCACGTGAATCTGCATTTGTCCATGCGCTATCGGCTGCATCTTTAGCTCATGCTGTAGCCAGAGCTTGCAGTCGTGGAGAATTGAACGAGTGTTCTTGTGACGCCAGGGTGAGGAAACGCACTCCAAGACATTGGCAATGGGGTGGATGTTCGGAGGACATAAGATATGGTGAGAAATTCAGTCGTGATTTCGTGGACGCCAAAGAGGACAAAGAGTCTGACGAGGGGATCATGAATTTGCACAACAACGAGGCCGGTCGGAGGGCTGTCCGTGGCCGTATGCAGCGTGTATGCAAATGTCATGGCATGTCTGGTTCCTGTTCCGTACGCGTTTGTTGGCGTCGTCTACCGCCTCTGAGGGCAGTCGCTGATGCTTTATCCACCAGATACGAGGGAGCGAGTCATGTCAAGGTCGTTGAAAGAAAAAAAGGGAAGAACATAAGGAAGTTGCGACCAATCCATCCTGACATGAAGAAGCCCAATAAAACTGATTTAGTCTATTTGGAAGAATCACCAGATTACTGCGAGCCAAATGAAGAACTAGGCATTCTCGGTACGCGGTCCCGAACCTGCAACCGTACATCCGCTGGTTTAGACGGCTGCCGCTTGCTGTGCTGCGGACGCGGATATCAGACTCGTGTGCGAGATCACGAGGAGAAGTGCCGTTGCCGATTTGTGTGGTGCTGTCGCGTCCACTGCGAGATATGCCGCTCCAAACGAGACCACCACGTATGCAATTAG

Protein sequence:

>DPOGS206421-PA
MRFPGARAALKVSQPFTHPLRNLAAPVKTTQSSNTSLETYSTLQKEACHRLDFLVERQKQLCMLSDKMVQVLQTGAQQAVEECQYQFRNSRWNCSTVENSTDIFGGVLKFKSRESAFVHALSAASLAHAVARACSRGELNECSCDARVRKRTPRHWQWGGCSEDIRYGEKFSRDFVDAKEDKESDEGIMNLHNNEAGRRAVRGRMQRVCKCHGMSGSCSVRVCWRRLPPLRAVADALSTRYEGASHVKVVERKKGKNIRKLRPIHPDMKKPNKTDLVYLEESPDYCEPNEELGILGTRSRTCNRTSAGLDGCRLLCCGRGYQTRVRDHEEKCRCRFVWCCRVHCEICRSKRDHHVCN-