Monarch geneset OGS2.0

DPOGS204620
TranscriptDPOGS204620-TA1491 bp
ProteinDPOGS204620-PA496 aa
Genomic positionDPSCF300432 + 19468-25193
RNAseq coverage127x (Rank: top 57%)
Annotation
HeliconiusHMEL0101572e-10758.40% 
BombyxBGIBMGA012178-TA2e-12952.55% 
DrosophilaSu(fu)-PA6e-8038.96% 
EBI UniRef50UniRef50_UPI00015B5CC66e-8841.61%UPI00015B5CC6 related cluster n=1 Tax=unknown RepID=UPI00015B5CC6
NCBI RefSeqXP_975412.22e-9644.12%PREDICTED: similar to suppressor of fused [Tribolium castaneum]
NCBI nr blastpgi|1892346384e-9544.12%PREDICTED: similar to suppressor of fused [Tribolium castaneum]
NCBI nr blastxgi|1892346381e-9443.25%PREDICTED: similar to suppressor of fused [Tribolium castaneum]
Group
KEGG pathwaytca:6643126e-96 
 K06229 (SUFU)maps-> Basal cell carcinoma
    Pathways in cancer
    Hedgehog signaling pathway
InterPro domain[53-497] IPR0077686.1e-109Suppressor of fused-like
[90-236] IPR0209411.7e-25Suppressor of fused domain
Orthology groupMCL12985 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204620-TA
ATGTCCGGACCGAGTTCTAATGCGATGCCGGCTGGTGTTGTTAGGAATAGTGCGTTGTCATTCGTTGCGTCTGCGCAGGCCACAGACGGATCAGACGTAGCAGTTCCGGTCAACCCAGCTGCTTTGGCTGTAGAGGGAGAACAGATGACTCAAACAGAGCGTTTGATGCCTCCAGGCCTGACTCCTCTTTATGAAGCCTGTACCAAAGTGTACCCTGACCAGCCCAACCCACTTCAAGTGACCACAAGACTTAAATATTGGTTGGGAGGCCAGGATCCTCTTGACTACATCAGCATGTACTGGAACCCCGGGAGACCTGATGAGAACATACCACCACATTGGCATTATGTTAGCTTCGGTCTATCAGACCTTCACGGTGATGGCCGAGTTCATCCAGAGCCACACGGTTGTTCTGGTTACGGCCTAGAGCTGACCTTTCGTCTGGCCAGTGAGGACAAGCAGCCTCCACTCTGGCCCGCCGCTCTCCTACAGGCTCTTGCTAGATATATATTTACTACCGGTAATAAGTTCGTGTGTGGCGACCACGTGTCCTGGCACGCGCCTCTGGACGGCTCCTCCTCGAGGGTCCGTCACCTGTTAGTAGCGACCGACCCTCAGTTAGAGACCACCAACACATGCCACGGCAGCGTCACGTTCTTACAGATGGTGGGTTGCACTAGTCGTGAGCTGCGAGCGGCGCAGAGAGGTTCAGGGTTTGAAGTGCTGAACATGATCAGTGAAGATCCGAGGTGTGGAGGCGCTTGGCTGGTGACTCGCACCCAGAGACGCGTCTCAGCTCGCATCCAGCACCAGAACGTGCAGAAGCCAGCTCACCTGGCCGGAGTGTCCGCTAGGGTCGCCTGGAGGGAATACAGCGTTAGTGTTTGTGTATATGATTGTACGAAGCAGGTGCCAAGGATCAGTCATAAGATGTCCACCGATAGTTTCAAGATGTCTTCCATAGAGAGAGCTCTGCCACAGTTACCGGAAATGATGCAGTGGGAGGGTTCCAGTACACGGCTAACTAGTTCGCCCACTCGCCACGAGATGGCGCTGCCGCTGAACGTCGAGAAGGGTTCCTGGCGCGGAGACGAAGTGGAGTATCTGAATGGAGTCCACTTACTCCTGAACGCGGAAGCCGCCAGCCTGCTACCGCTAGCTATAGACGGTCGAGTGCTCCACGATCGTCACTTCACGTGGCGTCAGGGTCCCCGGACGGTGACGCTCCTGACCCCCGCCGTTGGAGGAGCCTTCGTTACCAGGGCGAAGCCCTACGCCTCCAAAGGACCTTGGCTGCAGATCCTGATTCCACCGGAGCTGGCTGGTGATATGTCCAAGCAAGTGTCCGGCCTGGCCAGGCTGTCAGACAGTGACTCGGAGAGTGACGAGGACAGTACGGAGACCTCACCCAGACCGAGCATACCTATAACACTCACATGGCCTCACTATAGACTGAGCATCTCTGTACTACATGACCTAGAAATATTATAA

Protein sequence:

>DPOGS204620-PA
MSGPSSNAMPAGVVRNSALSFVASAQATDGSDVAVPVNPAALAVEGEQMTQTERLMPPGLTPLYEACTKVYPDQPNPLQVTTRLKYWLGGQDPLDYISMYWNPGRPDENIPPHWHYVSFGLSDLHGDGRVHPEPHGCSGYGLELTFRLASEDKQPPLWPAALLQALARYIFTTGNKFVCGDHVSWHAPLDGSSSRVRHLLVATDPQLETTNTCHGSVTFLQMVGCTSRELRAAQRGSGFEVLNMISEDPRCGGAWLVTRTQRRVSARIQHQNVQKPAHLAGVSARVAWREYSVSVCVYDCTKQVPRISHKMSTDSFKMSSIERALPQLPEMMQWEGSSTRLTSSPTRHEMALPLNVEKGSWRGDEVEYLNGVHLLLNAEAASLLPLAIDGRVLHDRHFTWRQGPRTVTLLTPAVGGAFVTRAKPYASKGPWLQILIPPELAGDMSKQVSGLARLSDSDSESDEDSTETSPRPSIPITLTWPHYRLSISVLHDLEIL-