Monarch geneset OGS2.0

DPOGS214293
TranscriptDPOGS214293-TA822 bp
ProteinDPOGS214293-PA273 aa
Genomic positionDPSCF300014 + 2340159-2341727
RNAseq coverage22x (Rank: top 79%)
Annotation
HeliconiusHMEL0114361e-14191.44% 
BombyxBGIBMGA006004-TA6e-11485.07% 
DrosophilaWnt6-PB2e-7258.00% 
EBI UniRef50UniRef50_E2A0L77e-9157.69%Protein Wnt n=3 Tax=Camponotus floridanus RepID=E2A0L7_CAMFO
NCBI RefSeqXP_001603351.13e-9361.45%PREDICTED: similar to wingless-related MMTV integration site 6 homolog [Nasonia vitripennis]
NCBI nr blastpgi|3227886694e-9362.50%hypothetical protein SINV_15820 [Solenopsis invicta]
NCBI nr blastxgi|3227886691e-9762.50%hypothetical protein SINV_15820 [Solenopsis invicta]
Group
Gene OntologyGO:00072753.8e-134multicellular organismal development
GO:00160553.8e-134Wnt receptor signaling pathway
GO:00055763.8e-134extracellular region
GO:00051023.8e-134receptor binding
KEGG pathwaynvi:1001196138e-93 
 K00445 (WNT6)maps-> Basal cell carcinoma
    Pathways in cancer
    Wnt signaling pathway
    Melanogenesis
    Hedgehog signaling pathway
InterPro domain[17-273] IPR0058173.8e-134Wnt
Orthology groupMCL13262 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214293-TA
ATGCTTCAGGATTTTATTTCTTTAGCCCCGGAGCGTGTGTCGAATCATACAAGAGAAACTGGGTTTGTGAATGCAATAACAGCCGCCGGCGTTACTTACGCTATTACACGGGCGTGTACAGCTGGTTCCCTCCTTGAATGCTCATGTGAAAAGGAAATTCCAAAACCTAGAAGAGGACGTGTTACTCAAGTACCCCAGCCGCCATCTCCAGTGCAAAAGGATAAATGGCAATGGGGTGGTTGTAGCGATAATGTTCGCTTTGGTCTACAGAAATCCAGAGAGTTTATGGACAGTCGATATAGGAAAAAAAGCGATATAAAAACATTGATTAAATTACACAACCACAATGCTGGGAGGTTGGCAATTAAAAATAATATGAAAGTGGAATGTAAATGTCATGGACTATCAGGGTCTTGCACGCTTCGTACATGTTGGTGGAGAATGCCAACGTTTAGAGAGGTGGGCGACCGTTTGAGAGATAAATTTGAGGGTGCAGCTAAGGTGATTTCAAATAATGACGGTGATAACTTTATGCCAGAAAGTCCAAATATCAAACGGCCTGGTAAGAAGGATATCATTTACTCTGAAGAATCACCCGATTTTTGTACATTTAATATGAAGACTGGATCTCTGGGAACAGAAGGGCGACAGTGTAACGTAAGTTCCGCTGGAACCGACAGCTGTGATCAACTTTGTTGTAGGAGGGGATACGTGCAAAATACCATCAGAGAAGCCGAAAATTGTAATTGTCAATTTAAATGGTGTTGCGAAGTGATTTGTGAGACTTGCTACGTTAAAAGAGATATACAAACGTGCCTTTAA

Protein sequence:

>DPOGS214293-PA
MLQDFISLAPERVSNHTRETGFVNAITAAGVTYAITRACTAGSLLECSCEKEIPKPRRGRVTQVPQPPSPVQKDKWQWGGCSDNVRFGLQKSREFMDSRYRKKSDIKTLIKLHNHNAGRLAIKNNMKVECKCHGLSGSCTLRTCWWRMPTFREVGDRLRDKFEGAAKVISNNDGDNFMPESPNIKRPGKKDIIYSEESPDFCTFNMKTGSLGTEGRQCNVSSAGTDSCDQLCCRRGYVQNTIREAENCNCQFKWCCEVICETCYVKRDIQTCL-