Monarch geneset OGS2.0

DPOGS210629
TranscriptDPOGS210629-TA9021 bp
ProteinDPOGS210629-PA1372 aa
Genomic positionDPSCF300168 + 534421-548882
RNAseq coverage1084x (Rank: top 12%)
Annotation
HeliconiusHMEL0071670.086.53% 
BombyxBGIBMGA013580-TA0.080.54% 
DrosophilaApc-PA0.040.69% 
EBI UniRef50UniRef50_B4JSH40.043.64%GH18063 n=1 Tax=Drosophila grimshawi RepID=B4JSH4_DROGR
NCBI RefSeqXP_001955080.10.043.32%GF16423 [Drosophila ananassae]
NCBI nr blastpgi|1947452080.043.32%GF16423 [Drosophila ananassae]
NCBI nr blastxgi|1951127960.041.34%GI10524 [Drosophila mojavensis]
Group
Gene OntologyGO:00054884.6e-62binding
GO:00160553.3e-07Wnt receptor signaling pathway
GO:00080136.3e-07beta-catenin binding
GO:00055151.9e-06protein binding
KEGG pathwaydan:Dana_GF164230.0 
 K02085 (APC)maps-> Basal cell carcinoma
    Colorectal cancer
    Pathways in cancer
    Wnt signaling pathway
    Endometrial cancer
    Regulation of actin cytoskeleton
InterPro domain[459-753] IPR0119894.6e-62Armadillo-like helical
[377-743] IPR0160241.1e-38Armadillo-type fold
[1145-1168] IPR0092233.3e-07Adenomatous polyposis coli protein, cysteine-rich repeat
[1011-1026] IPR0092406.3e-07Adenomatous polyposis coli protein, 15 residue repeat
[663-701] IPR0002251.9e-06Armadillo
Orthology groupMCL15417 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210629-TA
ATGAATCGAGACTATTATAAAGTGATGTTCACGTACAAGAAGACGAATCGTTCGAAGGCACACAGTCGGGACGTTCTCGACAACGACACGGAATTCAATGCTAACCAGTTCAAACTTTACGACGACTTCTCATCGGAACACAGTTACAAGTTCACTACGCTGAGGAACTTTCATGAGGTCGTGAGGAATGAGAGACAGAGATGCGATCCTAACGATTCAGGCATCGACCTCGACAACGGGAACCCGACGTATGTCACGAACACTGACAGTGATCTGTTGGAGAACGAAGTAATGGTAGATCCTTGGAGCAGTATGGATTCATCGAATTCACCGTTGACTGATTCAGGACCGTCGGCCAGTGAAGAATGCACTCCTTCTACTTCGAGCGAACCTCCTCGAGGGCCTCTGGATGCATCTTCTCCAATATCTCCAGATACACGTCGCTTACCAGATCCAAAAGACACCAGACTCAAATATCCAAGACAAAACTTAAATAAACCACGTACCAGAGATCTAACCCTGGATCTAGAATCTGACCTCTCCCGCCGACCGCATTTTCTTGATGACGATTACCAGCTCCCATCCAAATCAAACATTGAATGCCGTATTCCGAAACCTCATTTCCTCGATCACTCGCCATCACCGGACGAGTTTGATGAACGCAGTGACATAACTAATCCAAATTTCCTTGACGATGAAGTCGATGCTGATGATAATCAAGCACAAAAGAAATCCGGTGCTCTGCCCAAAAGGACAAATAAACCACAGACATCTTACACGTCTCAAGATGAGAGACCGCACAGGACCAATTACAGAAGCACACTTCATTATGGCTTAGAGAGTGCAGCCAGATCGAGCGAACGTGACAAGAGGGCCTCACAACTCTTCAGAGGGACCTGGCCAGGAGAGAGAGACGTCTGGACTGCACACGATTCAGGAAGTGTTAGGAGTTTCTCTAGCAACGGCTCAGATGGACCGAGACCATCATCAAATTTGGAAAATAAAATGGAATGTGTCTGTTCACTTATATCTCTTTTGAGCTTATCCCCAGAAAACAACGCCGATCTTAGCGAGCCACTGTTAGAAATGAGTAGATCAATTGAAAGTTGTATAGCAATGAGGCAAGCTGGATGCATTCCATTACTGGTTCAATTGATCCACTCAAAAGTACCGAGAGAAACGCGAGATCGCGCAGCCAAGGCTCTTCGTAATATAATACACACTCAAAAGGATGACAAGGCCGGCAGAAGGGAAGCGAGGGTCTGGCGACTGCTAGAACAGGTCCGGGAGTATTGTTATGTATTAGAAGGTGTGGTTGAAGCTAGGAAAGAAGGCAAAGAAGCGGTAGAAGATGACGCCACCAAACATCCTAGCCAGAGTGTTGCTGCCTTGATGAAATTGTCCTTTGATGAAGAGCATCGCCACGTAGTTTGTCAATTAGGAGGTTTGCAAGCCTTAGCGGCGCTTGTTAGTGGGGACCAAGCCGCACACGGAAGCAGAACTGACGATAACACATGCATAACGATGAGGAAGTATGCTGGTATGACACTTACAAATTTGACATTTGGGGACGGAAATAATAAGTCGTTGTTGTGCTCATTTAAAGACTTCATGTTGGCGCTGGTTGAACAGCTGGAGTCTCCTAACGATGACATGAGACAGGTCACGGCTGCGGTGCTTAGGAATTTATCTTGGAGGGCTGATACAAATAGTAAACAAGTGCTAAGAGAAGTGGGTGCCGTGAAAGGGTTGACAAAAGCCGCTATGACGTGCCAAAAGGAAGCTACACTCAAGTCTGTTCTCTCAGCACTCTGGAATCTCACAGCTCACTGTTCCATGAACAAAGTGGCTTTGTGTTCTGTTGATGGCGCTTTGGGTTTCCTTGTAGACATGCTAAGTTATAATTCACCAACGAAGACGTTAGCCATCGTGGAGAACGCGGGTGGTATCATGCGGAATGTGTCCAGTCATATCGCGGTCCGAGAAGATTACAGGCAAATTCTCCGCGAGAGAAATTGTTTAAGTGTCTTGCTACAACACCTCAAGTCTCCAAGTCTAACTGTAGTCAGCAACTCGTGTGGTACTCTCTGGAATTTATCGGCGAGGTGTCCTCAAGACCAACAGTTCCTGTGGGACCATGGAGCAGTACCGATGCTCAGGAGTCTTATTCATTCGAAACACAAAATGATAGCTATGGGATCCAGCGCTGCTTTGAAAAACCTACTTAATTCTAAACCCGGGAAGACACATATAATATCCCTGGACACCACCGCGAGAAGTATGAATTTGCCCGCTTTACCGACCCTTGTAGCTAGGAAACAAAAGGCACTGGAACAAGAACTAGACCAGAACCTTGCAGAGACCTGTGACAACATAGAACCAGCGACCTCACCGACTACAAGCAATAGAGACGAGAAGAATCTTTTTACAGCCACTGAAAGACAAATAGCAATGAATCTAGAAAGGCACAGGATGACGTCCAGCTCGCCGCTCATGGGAACGCTCTCCTCTCAGATATCTTTAAGTCACAGCGCACATCTCGCTAGTTACTTGAGCTGCAGCAATACCCTCTTAAAGGGAGCGCTGGTCACTCGTTCTACAGGAAGCAGTTCTAACAATGTCAATAGATCCGACAGCAAGGACTCGGTGACGAGTACTCATTCTGATTCCGTATTTGAGAGAGGCGCACGTACCGGAAAGGTACCTGTGCCGACGCCCAGAAATAAGGACCTCATGAAAATGGACACAGGAAGTGATACATTCAAATCTCAAACTTTAACAAAAGATCCTTCGTATTTACCATACACCCAACCACCCATACCACCTCCGAGGAGTTCGACTGACATGAGAACCAACTACACAGAATCCGGTTACGACGTCGACCAGGATTCTTGCGACCAACCGATCGATTTTAGTAGGAAATACTCAGAAACTAAGACTGATACGGAGCCTGTAGAAAAATCAAAGTCTGAGTCCAAAAAGTATACGAAAAAAGATAACCCGAATACCTTTGGAGATTACCAAGAGACTGATCTGGATCAACCCACGGACTACTCGTTACGTTACGCGGAACACCAGTCGGAGACGGGCTCGGATATATCAGAACCGCCCGCGCCTTCCGTCCACGAGGACACTATAAAGCATTTCGCGACAGAGGGCACGCCGTACGAAACGCCGATTATTTTCTCCACAGCTACGTCCATGTCCGATCTTCGCATAATCGGCAAAGACGGGAAGCCCAAGACGTCCAGCGTGAAGGAACATCCCGAAGCCATGACTCACTCAGACGAAAAGGACACCTGCTCCATAGAAGACCCGCCGCGGCATAATAACGACGAAGTCACAACGTCACCGCCACAAGCAGTCCCCGAGCCTATAGCGGAGAAACGAAGCGTCTTTGATACGAAGTTCAGCTCGGGAATGATCAGTCCCGAGAAACCTGTGAACTACTGTGACGAAGGAACCCCGGGGTACTTCTCCAGGGTCAGCTCGCTCAGTAGTCTGGGCGAACCCACACAAGAAGACAGCCTGGCCAAGAAGAACGCTCAGGCATCCATCAATGTTGACCCGAGCCCTCAGGAGCAAAGCACTAGTAGTTCAGGAAAAGATAAAGAAGGATCTAAGGCGGTGACGTTCGCCACGGAGGCCGTGTCGATCGAGTCGTCCACCCCGCCCCGCTTCATAGAGGACACTCCTCTTATGTTCAGTCGGTCCAGCTCGCTGGGATCCCTGCCGGAGTGCTCGCAGCAAGACGACCAGGGCTCGGTGGTGTCGGAGATCAGCCGGTTGACCTCGGGCTGCATCTCCCCCAGCGAGATCCCGGACTCCCCCGGCCAGTCCGTGCCGCTCTCCCCACGGGCCCAGCTGTCTCCAAGGATTCCTCCCATACATCACGTGGAGTTATTGTCGTCTGTGTTCGAGGAGCGGCCGGCACAGTTCGTGGTGGAAAACACTCCGGCACACTTCTCAGCCAACACCAGCCTCTCCAGCCTCACCATCAACGACGAACCGAGGGTACACACAACACTTACACTAATATACTACTCTAGGCTCCTTAACAAGGGTTTGTATCGTGTAACTACGAACAGAGAGTTTGCTGTTTTTAATTCATATACCTGACAGTATCAAAGTGAGCAGAGGAAAAACTAGAAACTTTGCGTATGTAGTTTGTTGATGCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTAGCTAAAGATTCCGCCAGCGATATCGTCAAAGACGGCGATTCCGATAGCGACCCCCAGAGCGACGGTGTGCTGCAAGCCTGTATAGAGCGAGGCATACAACAGGTGGTGAAGCACAAGGGACCTCCGTCCGACGGGTCGGCGTCCTACACGGCGAGGAACGATGTTTACAGGTCTCTTCCCGCACACTTGAGGTCTTCCACTGAGACCGTGATGATGTCTAACGACTTCAGGAGCGTTCGTGAGAGTCCTCCGCCGTTGCCGCCCAAGGAGCGTCGCAGCCCCGCCCCGCCACGTCCGCCGCCGCCGGATGACTTCGGTCGAAGACAGACCGAGAGAGCCCAGCGGACTCCAAAGCCAAAGTTACCGGTCACACAAAACTTAGTCAGAAACGATTCTCTCAGTTCACTCAGTTTAGACTCGTTTGGTTCTACCGATAGGGAGATATTCGAAGAGACCGTGAAAGCGGGGCTGTCGAGTGTCAACCCTAGAGCGCAATACACTCTCGATAATTGCAAAGGCAAGGCACACTCGGCAGAGAGACGGACGGAACCGAAACATAACAGACACAACAGGTCACTCGATAGAAGCGACAAGATCACCAACAGGGGTCACAAGGTACGGGACCCGGAGTTCGAGAGAAACATCGCATCTGGGGCTTATAACATTAAGCCGAGCAACAAAGTCAAGAAGCTCGAGCAGGAGTTTGCTAATTTCAATGTGGCGAGATCCTCGTATTCATATCACGGAGAAGGTTCTGGAACTCATCAGTCGAGGTTCAGAGGTCACGGCCGGTATTACGACGATGGACCCCCGAGCCTGCCCACCAGAAGCGACGAACCACGCAGGCTGGATAGAAGCAACTCACTCAGTTCACTCAGCAACGATTCATTTGGTTCCACGGACAAAGAAGTGTTCGAACGTTCAGTACGGATAGGAATGTCGAAGCCACTACCGAAAAAGAAGGAGGAACCTAAACTAAGAGTAAGGAGTGACGATCGACTGGAAGTAAGAGAGTCACATAGGAGACGTCGCCGGGACACCAAGCATCAGGGGGCCTTGGACAAAATGGTCGGCGAGGAATACTCGAAGGATAGGGCGATATTGGAAGAAGTTATAGCGCGAGGCGCCGGCGAGACTCAATCAGCTAATCAGAGCGCTCAGTCAAGTCAGTCAAAAAACGTCGAGGCGCCGTCTGCGGGTCCCGCGGCCGCAGACGACGCACGCACCGCGCGAGACAACATTGGCTCTGATGACTCCGCCATACACACCATGGACGACATACAGATCACCATCAACGACACGCCCATATCCATCTTCGACCGATCTAATGAAATGCCCACCGTCGACTCGGGAAATACAACTCTAGACTCGGACTGCATAGAGATGGATAAATCCAACGAGTGTTTGGCGCCGGCGTCCTGCAACACTACGGTGGAATCTGATAGAGACGAGTTAAATCGATCCAATGAATCGTATGCCGATGTTTTGGATGGCTCCTGGAGCGAAGACGAAAAGCACAAGGAGTATGATTCAGGTACATTGACAAGAAAGAGTAAACACAAGCCCGAGTGGACGGATGTGACGTATATGGGGGGTCCGGCGAACTTGGAGTTTGAATCCAAGAATAATGACACTTGGAATGAAAACACGTGTCCGGATGACGTCACTTTCCCTACAATCAGTGGCTCCGTTCATATGGTGTCGTCCATGAAGAGCGAGATCGTGGACACGGCTATGGCTCTCCCAGACTTGCTAGAAAAAAGCGAAGCCCCTAATTTGTTCTCAAAACCAGATTTGACCGACAAACTAGACGGAAAGATATCAGAGATTGACGACGTCCATTTGGATGATAAAGCCCTGATTGAAAATATACCCGAGCCAAGCTTCACGTCGCTGGTGGATGAAGCGGAGCCTAAATTTGATTCTATTATGGCGTTAGCTATTGAAAAGGAGGCTGCAAGACTTGCCGCCCAATTGAAAAGTTCCCAATACGCAATGGAAAATAGCGTTACGTCTTTGACGTCGCTGGATTTGGATAACGTGAAACCGCCATCGAATTTAGGCAGTCTACTCTCCCTCAGTGCGTCGGGCCACTGGGATGAGTCAGCTCAAACATCGAGGAAGTCACAAAAAAGTAGGAAGAAGTCGCTGCCCGTTGCTCTCATGATGAAAAGAGCTCTTAGCAACTCGATGCATCAAGGAAGCTCCGAACATCTAGATAGTATCCCTCTTAGTTTATTAGACAATGTCAAACCTCCGTCGGAAATGGAAAATCTCGATATGGATGGAAGCATGATTTCAGTGTCGAGTATTGTGTCCGAAGTGGCCGAGACTAAAGATAGCAAGACCCCGTTGATATTCGATTACAAACAACCGGTTCAGGACTTCCCACTGTGCTCTACATTTACACACGTCTTCCACGACTTAGACAAAGTGAATCCGCCTTCATTGTTCGATGAAATAGCAGAGTCTACTTTGGAAGCAGACCAGACAACTGCTCACCAGGTGTACGATGATTGTACATCGAACACTCTGAACGTGATCACTGACATACCGTCCGGCTCAGAAACTTGCACGCCCTTACCATCAGATATCAGCAGCGTCGAATCAACACCAAAAAGACAACGTGATCCTAAATATTTAACCCCCAAAGAAAAACGAAGTGCGGCAAAGCACAGATATCAAACTTATACTATCACGGACACCGTATCTAGCAACGATGTTGTACTGAAAGTAGAATCTGAGGAATTTATAACTTGGACGAAATCTGAAAGCGATAAATCAGACGAATATGTAACCGCCAACTCGGAATCAAAATCCAAACGGCGACTGAGCGCTAAGCAACGAAGACAAGAGGACAGAGCTAGGTATCAAACTCAGACCGTCAACATACATAATATGCTTTCTCAGGATTCCTCTCAGGAAAGAGAACAACTGAATCCTCAAATCGAGAGTCTCAAACAACGATTGGCTGCAAAAAAGACTCTCAAACAGAAACGCATCGAAGATGCAGAGAGATTCAGGACCAGGACACTCAGCGAAGACATCCCGCCCTCTCCAACGTTTGTCACAAAAGACGCTAATTTTGAAAACGTTGAAACAACGACCGGTTACGACAGCCTGTCCTCGAATGAATTGAATCACCAACAACTGCTGGACGACCGACACAGCGATGACGTGTTCAGAGACGTGGACAGCGGACACAATGAGGACGACTTCGAATTAAATTCCACTAAAATGAAAACTTACACTAAAAGTTTCAGGAACTATCTGCCCGTCATCGAGAGTCCGGCGTCGGTTGACATGTGCGTCGTTAATAATCTGAAAAACATAGAAATGACCGCCTCCTACCGACGTAACTTACAAAGCGATAAAATTCAGAATCCGAGCAGCAGTGACTTTGCCAGCTTCGAAGGTGACAGCCACTCTGAAGGGAAAGCATTGTCGGAGTCGGATTCGGAAGCACCCACGTCGAGGCCGAAACCGAAAATAATTAAGCCCGAGAGAAGAGATGGAAGTTTAGATTCCAACGAATCAGGTGACAGGGAACATGAAGCTCCTAAAGTCGTAAGGGGTAGGAAAAAAGCTCTGTATGTATCACCTTACAGACGAAACGTACCAAGCCCCAAGAAACAAGCTACGCCAACAGCTGCTAAAATCCCTCCAAAGTCTGCCCCTACACAGAAACCGACTACCAGTAAAACAGTAGGCGTGAAAGCGCCAACCAAAACAGCTGTGACTTCTAAAGTACTCCCAACTACTTCTCCTAAAAAGACTGCCCTCAATAAATCCCCATCCAAATTAGCTCAAAAACCAACTAACGCACCGTCTTCCAGCAAACCCGCACCTCTAGTCAGACAGGGTACCTTCACCAAAGAAGAAAGCTCAGTACCCGCAAAAGATCTGCCTGTTCCGAGTAAAAAGTCCGTCGGAACTCGGCCCGGCACCGCCACCTCACCCACAAAAACACTGGCCAACACTTCGCCATCGAGGTTGCCGCAATTTAATCGAACCCGTACCTCCACTGCCAAACCAAATACCAAACAGAATTCAGCAAAACGTGCCTCAGAGCCTATAATGAAAAACTCGCCTTCCAACCACAGCTTACAGAGCAACGACAGCGGCAAAACTATAGTATTGGCACGCGGTTCTCGACAAGGAAGCACTTCAAGCGTGAATTCTGTAGCGTCCTCTAAAACAAAAGAAGTGGAAAGCAAAATTGCCAACCTGTGGAAGAAAGTCGAGCAGACGAAGAAATTACCGGCTAAGGCCGATAAGAGAGTTTGGATTGATAGCGACAAGGCCGAAACACCGAAACTCATAAGGAGTTCCACGTTCGAAGGTCAACCGAAACTCAGCCCAGTGTCCACAGCAACCAAGCAAAAGTCCGCGATCGGCATACGAGTGTCGCAGATACCCAGCCTGAGGCCGAAGACGACGCCGGGCAGAGCCGCGAGTCAAACCAATGTCAACAAGAAACCAGCCAACAAAGGATTCTTACGGAAGGGCGGCGGTCAGGTGACGTCATGA

Protein sequence:

>DPOGS210629-PA
MNRDYYKVMFTYKKTNRSKAHSRDVLDNDTEFNANQFKLYDDFSSEHSYKFTTLRNFHEVVRNERQRCDPNDSGIDLDNGNPTYVTNTDSDLLENEVMVDPWSSMDSSNSPLTDSGPSASEECTPSTSSEPPRGPLDASSPISPDTRRLPDPKDTRLKYPRQNLNKPRTRDLTLDLESDLSRRPHFLDDDYQLPSKSNIECRIPKPHFLDHSPSPDEFDERSDITNPNFLDDEVDADDNQAQKKSGALPKRTNKPQTSYTSQDERPHRTNYRSTLHYGLESAARSSERDKRASQLFRGTWPGERDVWTAHDSGSVRSFSSNGSDGPRPSSNLENKMECVCSLISLLSLSPENNADLSEPLLEMSRSIESCIAMRQAGCIPLLVQLIHSKVPRETRDRAAKALRNIIHTQKDDKAGRREARVWRLLEQVREYCYVLEGVVEARKEGKEAVEDDATKHPSQSVAALMKLSFDEEHRHVVCQLGGLQALAALVSGDQAAHGSRTDDNTCITMRKYAGMTLTNLTFGDGNNKSLLCSFKDFMLALVEQLESPNDDMRQVTAAVLRNLSWRADTNSKQVLREVGAVKGLTKAAMTCQKEATLKSVLSALWNLTAHCSMNKVALCSVDGALGFLVDMLSYNSPTKTLAIVENAGGIMRNVSSHIAVREDYRQILRERNCLSVLLQHLKSPSLTVVSNSCGTLWNLSARCPQDQQFLWDHGAVPMLRSLIHSKHKMIAMGSSAALKNLLNSKPGKTHIISLDTTARSMNLPALPTLVARKQKALEQELDQNLAETCDNIEPATSPTTSNRDEKNLFTATERQIAMNLERHRMTSSSPLMGTLSSQISLSHSAHLASYLSCSNTLLKGALVTRSTGSSSNNVNRSDSKDSVTSTHSDSVFERGARTGKVPVPTPRNKDLMKMDTGSDTFKSQTLTKDPSYLPYTQPPIPPPRSSTDMRTNYTESGYDVDQDSCDQPIDFSRKYSETKTDTEPVEKSKSESKKYTKKDNPNTFGDYQETDLDQPTDYSLRYAEHQSETGSDISEPPAPSVHEDTIKHFATEGTPYETPIIFSTATSMSDLRIIGKDGKPKTSSVKEHPEAMTHSDEKDTCSIEDPPRHNNDEVTTSPPQAVPEPIAEKRSVFDTKFSSGMISPEKPVNYCDEGTPGYFSRVSSLSSLGEPTQEDSLAKKNAQASINVDPSPQEQSTSSSGKDKEGSKAVTFATEAVSIESSTPPRFIEDTPLMFSRSSSLGSLPECSQQDDQGSVVSEISRLTSGCISPSEIPDSPGQSVPLSPRAQLSPRIPPIHHVELLSSVFEERPAQFVVENTPAHFSANTSLSSLTINDEPRVHTTLTLIYYSRLLNKGLYRVTTNREFAVFNSYT-