Monarch geneset OGS2.0

DPOGS212368
TranscriptDPOGS212368-TA1728 bp
ProteinDPOGS212368-PA575 aa
Genomic positionDPSCF300019 + 278116-280064
RNAseq coverage995x (Rank: top 13%)
Annotation
HeliconiusHMEL0101870.093.09% 
BombyxBGIBMGA004639-TA2e-16887.16% 
Drosophilafz2-PA0.059.45% 
EBI UniRef50UniRef50_B4KYW70.061.31%GI13462 n=7 Tax=Diptera RepID=B4KYW7_DROMO
NCBI RefSeqXP_001868222.10.066.24%frizzled-2 [Culex quinquefasciatus]
NCBI nr blastpgi|1700667840.066.24%frizzled-2 [Culex quinquefasciatus]
NCBI nr blastxgi|1700667840.066.06%frizzled-2 [Culex quinquefasciatus]
Group
Gene OntologyGO:00160208.3e-127membrane
GO:00071668.3e-127cell surface receptor linked signaling pathway
GO:00055151.5e-70protein binding
KEGG pathwaycqu:CpipJ_CPIJ0180480.0 
 K02375 (FZD5_8, fz2)maps-> Basal cell carcinoma
    Pathways in cancer
    Wnt signaling pathway
    Melanogenesis
InterPro domain[14-558] IPR0155263.1e-263Frizzled-related
[252-558] IPR0005398.3e-127Frizzled protein
[57-176] IPR0200671.5e-70Frizzled domain
Orthology groupMCL11478 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212368-TA
ATGAACCTGCAACAGAAGTGCGTGCCAGCTTTCTTTGTTCGCTTCCACCATTTAAATATGTTTAAGAAATTGTCGAAAGCTAAGGACTATGCGGTTAAACTAGTGATTATGATGTGGCGTGTGGTGTCGCTGGTGCTGGTGCTGGCGGGGGCGGAGGCTCTCCAGCCGCGCTGCGAGGAGATTACTATACCCATGTGTAGGGGCATCGGATACAATCTTACGTCTTTCCCTAATGCTCTCGACCACGACACACAAGACGAAGCCGGCCTAGAGGTCCATCAATATTGGCCGCTGGTGGAGATAAAATGTTCAGCCGACCTGAAGTTCTTCCTGTGTTCCGTGTATACTCCGATATGTATAGAAGACTACGCCAAGCCTCTGCCGGCATGTCGCAGTGTCTGCGAGCGTGCGCGGGCGGGCTGCGCACCTCTCATGCAGAAATACGGTTTCCAGTGGCCCGAGCGTATGGCGTGCGAGAAGCTACCGCGTCTCGGGGACTCCGAGCACCTGTGTATGGAGGAACCGGACCGCGCACAGGAACCCGAACCGCCGCGCCCCCCGCCGCGTCGTCCATACAAAAACTGCAAGGATCCAAAAAACTGCGAAAGCGGCCCGGCTGCGAGCCCGGGCGAGGCGGGCGGCGACGAGTGCGCGTGCGCATGCCGGCCGCCCCTCGTCAGCGTGCGCGCGCTGCACAACGCCAGCGCCACGGCGGCCGGCGTGCCCGGCTGCGCGCTGCCGTGCCGCGGGGCCTTCTTCACAAGGGAGGAGAAGGAGTTCGCGGCCGTGTGGGTCGCGCTGTGGGGCGGTCTGTGCGCCGCCTCCACACTCATGACGCTCACCACCTTCCTCATTGACTCGCAGCGGTTCAAGTACCCCGAGCGACCGATCGTGTACCTCTCCGCCTGTTACTTCATGGTGGCCCTCGGGTACCTGGCACGGCTCGTCATCGGCCACGACGAGGTGGCCTGCGACGGCGCTCTCCTTAAGACCTCCTCCAACGGCCCCAGCGCCTGCACGCTCGTGTTCATTCTGGTTTACTTTTTCGGCATGGCGTCCTCTATCTGGTGGGTGGTGTTGTCGTTCGCGTGGTTTCTAGCGGCGGGTCTCAAGTGGGGCAACGAGGCGATCGCCGGTCATGCTCAGTACTACCACCTGGCAGCGTGGCTCGTGCCGGCCGCTAAGACCGTAGTGGTGCTGCTCGCGGGAGCCGTGGACGGAGACCCCGTGGCCGGCGTGTGCTACGTCGGCAACACCTCCTCCGAGAACCTCAAGAAGTACGTGCTAGCGCCGCTCGTGGTGTACTTCGCATTAGGCGCCACGTTCCTGTTGGCCGGTTTCGTGTCGCTGTTCCGCATCCGCTCCGTGATCAAGCGGCAGGGCGGCATCGGCGCCGGCTCGAAAGCTGACAAACTCGAGAAATTGATGATCCGCATCGGCGTCTTCAGCGTGCTGTACGCGGTGCCGGCCGGCGTCGTGATAGGCTGCCTCGCCTACGAGGCGGAGGGTCGCGAGCAATGGCTGCGGCGCGTGGCGTGCGGAGCCTCGTGTGGACCGCGTCCTCTCTACTCAGCGCTCATGCTCAAGTACTTCATGGCGCTGGCGGTGGGCATCACGTCCGGAGTCTGGATCTGGTCCGGCAAGACGCTGGAGTCGTGGCGACGAGTGTGGCGCGGTGGGAGGGCTCCTCCCCCCGCACACCGCGCGCTCGTGAAGGGGGCCGTGTGA

Protein sequence:

>DPOGS212368-PA
MNLQQKCVPAFFVRFHHLNMFKKLSKAKDYAVKLVIMMWRVVSLVLVLAGAEALQPRCEEITIPMCRGIGYNLTSFPNALDHDTQDEAGLEVHQYWPLVEIKCSADLKFFLCSVYTPICIEDYAKPLPACRSVCERARAGCAPLMQKYGFQWPERMACEKLPRLGDSEHLCMEEPDRAQEPEPPRPPPRRPYKNCKDPKNCESGPAASPGEAGGDECACACRPPLVSVRALHNASATAAGVPGCALPCRGAFFTREEKEFAAVWVALWGGLCAASTLMTLTTFLIDSQRFKYPERPIVYLSACYFMVALGYLARLVIGHDEVACDGALLKTSSNGPSACTLVFILVYFFGMASSIWWVVLSFAWFLAAGLKWGNEAIAGHAQYYHLAAWLVPAAKTVVVLLAGAVDGDPVAGVCYVGNTSSENLKKYVLAPLVVYFALGATFLLAGFVSLFRIRSVIKRQGGIGAGSKADKLEKLMIRIGVFSVLYAVPAGVVIGCLAYEAEGREQWLRRVACGASCGPRPLYSALMLKYFMALAVGITSGVWIWSGKTLESWRRVWRGGRAPPPAHRALVKGAV-