Monarch geneset OGS2.0

DPOGS210820
TranscriptDPOGS210820-TA2100 bp
ProteinDPOGS210820-PA699 aa
Genomic positionDPSCF300027 - 526514-533744
RNAseq coverage353x (Rank: top 33%)
Annotation
HeliconiusHMEL0085190.057.67% 
BombyxBGIBMGA007135-TA0.071.48% 
DrosophilaAxn-PA2e-5938.78% 
EBI UniRef50UniRef50_UPI0002063F844e-10635.64%UPI0002063F84 related cluster n=3 Tax=unknown RepID=UPI0002063F84
NCBI RefSeqXP_001656733.12e-9535.80%axis inhibition protein, axin [Aedes aegypti]
NCBI nr blastpgi|3800295877e-10735.89%PREDICTED: axin-1-like [Apis florea]
NCBI nr blastxgi|3800295871e-10735.77%PREDICTED: axin-1-like [Apis florea]
Group
Gene OntologyGO:00072751.9e-24multicellular organismal development
GO:00056221.9e-24intracellular
GO:00048711.9e-24signal transducer activity
KEGG pathwayaag:AaeL_AAEL0033885e-95 
 K02157 (AXIN1)maps-> Basal cell carcinoma
    Colorectal cancer
    Pathways in cancer
    Wnt signaling pathway
    Endometrial cancer
InterPro domain[618-697] IPR0011581.9e-24DIX
[97-225] IPR0161371.5e-19Regulator of G protein signalling superfamily
[109-222] IPR0003422.4e-15Regulator of G protein signalling
[449-487] IPR0149361e-10Axin beta-catenin binding
[209-225] IPR0240662e-08Regulator of G-protein signaling, domain 1
Orthology groupMCL12897 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210820-TA
ATGTTATTAAGTGTCGTTGTCCAGGTCGTGTCTCGCCATGCAGCGCGCGGGTGGGAGCAGGGCCTTTGCGCCATGAGCCATACTCCTGTAGGCGGCCACCCGCAAGGTTGGGAACACAAGCTTGCTGACAGGTCGTCGCTGCCGCCGGCGCCAGGGGAGGAGAAGAGACAATCTCAGACCAGACATGTGTTCACACACGCACATCTCACCAAAGCGGCCCCGTGTGTGGGCGGCGTTGCGTCGCGGCGCTCGGAGACGGAAGGCTCGTCCGGCAGCTCGGGACGATCCCCGGAGGAACCGCCCTACGCCAGGTGGGCGAGGACACTGCATCATCTGCTCGAGGATGGAGAGGGCGTGCGTCTGTTCCGCAAGTTCGTGTGCGGCGCGGGCGGGCTGCACGTGGACCGCCTCAACTTCTACTTCGCCGTTCAGGGCCTGCGCCAGGAGACCGAGCCCAGCAAGATACGGACCGTCGTCTCCGCCATATACAAGTTCCTCCGCAAGTCTCAGCTAGCGATGCCCGAGGAGCTGAAGCAGCGCGTCAAGCAGAGCCTCAAGGACGGCTCCAACATAGAGAAGACCATCTTCGATAATATGGAACAGGAGGTGACCCGCGCCATCACTGAGTCTACGTACCAGTCGTTCCTGCGGTCGGAGGCCTACGTGTCGTACGTGAGTGCGGCCACTCAGCCGCTGTCCTCGCCTGACGCCTCACCGACACACTCCAGAGAACTATGTGTGGGCACTCTGGCCACTTTACACGAGGGCCAGGAGTTATCAGGCGGCGCCTGTCCGTCCGTGGGCGCCAGGCTCACCCACGACGCTCTGCTCGCCACACAATCCCGACGACTACAGTCAGACGTCGCTCCGCACCGCAAGCGGTCCGTGTACAGCGCGCACGTGTCGTACGCGGGGTACACGCCCGCCTCGCGCCAGGACTCGGAGCGGGCCAGCCTCAGCAGCGGGCGGACGGACAGCGACGCGGTGTCTCTCTCCGGCAGCAGTCTTGACGGCATGTCCCTCCGCGGGTCCCGTGAAGCCCGCGAGTCCCGCCACCGGCCGCGGCTGTACGGCCTCGACCGACACGCCGTCATCAACAAGGAACAAGACACCGCCATGATGATCCCTCGCACGCAGCGTGTGCAGTCGGAGCAGCTCCGAGTGTTGCCGCCGCACGAGTTCGCACCGCTACTGATAGAGAAGCTGGAGCGAGTTAGGAGAGATCAGGACAACAAAGAGAGACTGGAGAGGAGACTCGCTGAGGGCGAAGGCGACGAGCTGTGCGCACAGGCTCTACCGCCACAGCTGGTGGCCGCCGCCATCAGGGAGAAGCTACAGCTGGAGGACGACAACGATCAGGATATACTGGATCAGCACGTGTCTCGTGTGTGGTCAGAGCGCACGCCCGACACGTCCCCGCCGGGAGGGAGGCGCACTCGCGGCCGCCACGGGCCTCACGGCCACGGGTCGCGCCGGGCGGCCTCGGCCCTGTCCGCCGACTCGGGACACTATGACGCGCCCCCGGACTCCCTACACCATCCCCACTCCTTGATACGCAGATCTTTCTCGAAGAAGACGGTGACGGAGCTGACGGACAGCGGGGTGTCGGTGGTGAGCGAGGGGGCGGCCAGTGTGGAGCCGCGCCTGCTGCTGTGGATAGCGGAGGGCTCGGAGAGGGCGGAGAGGCGGCTCAGGGACCTGTCCTCTCGGGGCTCCTCCGCCGACAGGGAAGACCACCGCCGCCGGGACAAGACACAGGCGCGGACACGTACTGGCGCCACTACCGGCAGTACGGGCAGCAAGTCCAGCGGCAGCGGTACAGCTACAGGCGCCACGAGTGGAGCTGGCACCGAGCACACCGTGGTCGTGGTTAACTTCCTGGACGAGAGCGTCCCTTACAGATTCAAGGTGCCCGCCTCGCCGCTCACCCTGCGCACGTTTAAGGAATATCTGCCCAGGAAGGGAAACTATAGATACTTCTTCAAGACGGAGTGCGCGGACCTCGACAACACGGTCATACAGGAGGAGGTGAGCAGCGACGGAGACACGCTGCCCATGTACGAGGGGAAGGTCATGGCCAGGGTCAAGAGCATCGAGTGA

Protein sequence:

>DPOGS210820-PA
MLLSVVVQVVSRHAARGWEQGLCAMSHTPVGGHPQGWEHKLADRSSLPPAPGEEKRQSQTRHVFTHAHLTKAAPCVGGVASRRSETEGSSGSSGRSPEEPPYARWARTLHHLLEDGEGVRLFRKFVCGAGGLHVDRLNFYFAVQGLRQETEPSKIRTVVSAIYKFLRKSQLAMPEELKQRVKQSLKDGSNIEKTIFDNMEQEVTRAITESTYQSFLRSEAYVSYVSAATQPLSSPDASPTHSRELCVGTLATLHEGQELSGGACPSVGARLTHDALLATQSRRLQSDVAPHRKRSVYSAHVSYAGYTPASRQDSERASLSSGRTDSDAVSLSGSSLDGMSLRGSREARESRHRPRLYGLDRHAVINKEQDTAMMIPRTQRVQSEQLRVLPPHEFAPLLIEKLERVRRDQDNKERLERRLAEGEGDELCAQALPPQLVAAAIREKLQLEDDNDQDILDQHVSRVWSERTPDTSPPGGRRTRGRHGPHGHGSRRAASALSADSGHYDAPPDSLHHPHSLIRRSFSKKTVTELTDSGVSVVSEGAASVEPRLLLWIAEGSERAERRLRDLSSRGSSADREDHRRRDKTQARTRTGATTGSTGSKSSGSGTATGATSGAGTEHTVVVVNFLDESVPYRFKVPASPLTLRTFKEYLPRKGNYRYFFKTECADLDNTVIQEEVSSDGDTLPMYEGKVMARVKSIE-