Monarch geneset OGS2.0

DPOGS214989
TranscriptDPOGS214989-TA1119 bp
ProteinDPOGS214989-PA372 aa
Genomic positionDPSCF300256 - 198950-203012
RNAseq coverage349x (Rank: top 34%)
Annotation
HeliconiusHMEL0101723e-10885.45% 
BombyxBGIBMGA012189-TA1e-11678.65% 
Drosophiladsh-PA1e-7756.27% 
EBI UniRef50UniRef50_D2A4301e-9466.10%Putative uncharacterized protein GLEAN_14903 n=1 Tax=Tribolium castaneum RepID=D2A430_TRICA
NCBI RefSeqXP_967594.12e-9566.10%PREDICTED: similar to dishevelled [Tribolium castaneum]
NCBI nr blastpgi|3071924432e-9661.22%Segment polarity protein dishevelled-like protein DVL-3 [Harpegnathos saltator]
NCBI nr blastxgi|3838575212e-10158.54%PREDICTED: segment polarity protein dishevelled homolog DVL-3-like [Megachile rotundata]
Group
Gene OntologyGO:00355565.1e-26intracellular signal transduction
GO:00072757.1e-18multicellular organismal development
GO:00048717.1e-18signal transducer activity
KEGG pathwaytca:6559466e-95 
 K02353 (DVL)maps-> Basal cell carcinoma
    Pathways in cancer
    Wnt signaling pathway
    Melanogenesis
    Notch signaling pathway
InterPro domain[45-367] IPR0155067.3e-125Dishevelled-related protein
[143-262] IPR0119919.8e-36Winged helix-turn-helix transcription repressor DNA-binding
[158-232] IPR0005915.1e-26DEP domain
[67-81] IPR0083397.1e-18Dishevelled
Orthology groupMCL10548 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214989-TA
ATGTCTCGGGAACAAGTCGCTGCACTGGTGCAGTGTGTGGAGGCTGTGTTGAAGATTATTCACGTGTATATTGTTCTTAAATATTCCTCTCTGTTCCGAATTCCGGTTAACTCTGATGAAGTCGTAAAACTGGTGAATGATGTGAACTTTGAAGACATGACCAACGATGAAGCTGTGAGAGTTCTCCGGGAAGTGGTCCAGAAGCCTGGGCCCATCAAGCTGGTGGTAGCCAAGTGCTGGGACCCAAACCCCAAGGGCTACTTCACAATACCTCGGACAGAACCCGTCCGACCAATAGATCCAGGTGCGTGGGTCGCTCACACCCAGGCCTTGCGCGAGGCGTATCCACCTCCTCCGCTGTCATCGGTCCCGGCGTCCCTGCCGGAGCGGGCTTCGGACGCGGGGTCGCTAGCGGAGCCCCAGTTGTCCGTCGGGATGGATATGGCGCTCGTGGTCCGAGCCATGCTGAGGCCAGAATCTGGTCTGGAGATCAGGGATCGTATGTGGCTCAAGATAACCATCCCCAACGCGTTCATCGGCGCCGACGTGGTCGACTGGATCCTGCAGCACGTGGCCGGCATAGTGGACAGGAGGGACGCCAGGAAGTACGCCTCGCACATGCTCAAGGCTGGCTTCATCCGTCACACTGTGAATAAGATCACGTTCTCCGAGCAGTGTTACTACGTGGCTGGAGAGCTCTGTGCCGATATGGCCGCGCTCAGGATACGCAGCGCCGACCAGGACAGCCTGGCCTCCGATACACTAGCGCCTCTACCGAATCCCAACATAATGGGTCCGGGCTACATGCCTTACGCTGGCTCCTACGGCTACCAGCCCATACCCTTCAAGTACAGCTCGTGCCTCACCAGCGAGCACACGGTGTACGGATACAATCGCGAGGAGAGCGTGTTGTCTGGGAGCGGCGGGTCCAGCGCGGGCTCCGACCACCTCACTACCAAGGAACCCCCGGGTCCTAACCGCGAGAACGAGGTGAAGTCTACCTCCAGCGGTTCAGGTGCCAGTGCTGCGGCGGAGGGCGGGGGCGGAGGCACCAGGAGGTCACACTCACACTCCAGCGGCTCGGAGAGAGCCAACGACAGGCCCGTGCTCTTCCTGTAG

Protein sequence:

>DPOGS214989-PA
MSREQVAALVQCVEAVLKIIHVYIVLKYSSLFRIPVNSDEVVKLVNDVNFEDMTNDEAVRVLREVVQKPGPIKLVVAKCWDPNPKGYFTIPRTEPVRPIDPGAWVAHTQALREAYPPPPLSSVPASLPERASDAGSLAEPQLSVGMDMALVVRAMLRPESGLEIRDRMWLKITIPNAFIGADVVDWILQHVAGIVDRRDARKYASHMLKAGFIRHTVNKITFSEQCYYVAGELCADMAALRIRSADQDSLASDTLAPLPNPNIMGPGYMPYAGSYGYQPIPFKYSSCLTSEHTVYGYNREESVLSGSGGSSAGSDHLTTKEPPGPNRENEVKSTSSGSGASAAAEGGGGGTRRSHSHSSGSERANDRPVLFL-