Monarch geneset OGS2.0

DPOGS201827
TranscriptDPOGS201827-TA2778 bp
ProteinDPOGS201827-PA925 aa
Genomic positionDPSCF300191 - 993275-1010947
RNAseq coverage979x (Rank: top 13%)
Annotation
HeliconiusHMEL0175380.080.09% 
BombyxBGIBMGA008436-TA0.081.13% 
DrosophilaDAAM-PD2e-17061.57% 
EBI UniRef50UniRef50_D4ABM30.046.81%Dishevelled associated activator of morphogenesis 1 (Predicted) n=4 Tax=Muroidea RepID=D4ABM3_RAT
NCBI RefSeqXP_002040330.10.057.28%GM18987 [Drosophila sechellia]
NCBI nr blastpgi|1953475800.057.28%GM18987 [Drosophila sechellia]
NCBI nr blastxgi|1487045980.047.66%dishevelled associated activator of morphogenesis 1 [Mus musculus]
Group
Gene OntologyGO:00037794.8e-124actin binding
GO:00160434.8e-124cellular component organization
GO:00300364.8e-124actin cytoskeleton organization
GO:00054881.6e-90binding
GO:00170481.1e-35Rho GTPase binding
KEGG pathwaydse:Dsec_GM189870.0 
 K04512 (DAAM)maps-> Wnt signaling pathway
InterPro domain[426-891] IPR0031044.8e-124Actin-binding FH2/DRF autoregulatory
[427-802] IPR0154251.4e-105Actin-binding FH2
[7-374] IPR0160241.6e-90Armadillo-type fold
[187-375] IPR0104723.5e-58Diaphanous FH3
[2-184] IPR0104731.1e-35Diaphanous GTPase-binding
Orthology groupMCL11671 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201827-TA
ATGCCACCCCAGGACGAGCTAGACGCCAAGTTTGCTGAACTAGTGGAGGAGCTTGACCTGACAGCCGTCAACAAAGCAGCTATGATGGAACTACCAGCAGCCAAGAAGTGGCAGATCTATTGCAGCAGGAGACCACCGCCAGGTCAAGCGCCACCACTAGCGACGGCGCCGCAAGTAGAAGAATACATCAAGGCCCTGAATGAAATTTCAGACGCGTTCGCGTCATCAGAGAATGCTCCTCCCAGTGAAGCTACCGGTCTAGTAGATGGTTTGAAGACTGCTCTCAGGACGCGGGCCCACAGCTTCGTCCTTCGTTTCATCAAGCAGGGAGGTCTGGCCGCCTTGCTAGAGGCTCTACAGAGAGCTCCTAGAGACGACGCTATCACACGACACAACCTCATAGCCGGCATTAAGGCGCTCATGAACAATTCCACCGGTCGAGCTCATGTGCTGGCTCATCCTACGAGTATAGATCTGATAGCGCAGTCTATGGACACCGAGAACGTTAAGACTAAAGTGGCAGCCCTGGAAATATTGGGAGCTGTCTGTCTAGTACCAGGCGGTCATAAAAAGGTCCTAGAAGCGATGATACATTACCAGAAATACGCTGGCGAGCGTGCCAGGTTCCAGGGTATAGTCAACGAATTGGATAGAAGTACGGGCGCGTATCGAGATGACCTCGGCCTCAAGACGGCTATCATGTCGTTCGTGAACGCTGTACTGAATTACGGGCCGGGAGAAGAAAGTCTTGAATTCAGACTGCATTTGAGATATGAACTGCTGATGTTGGGCATACAACCTGTAATCGAAAAACTGCGGAAATACGAGAACGAGACCCTCGACCGTCACATCGAATTCTTCGAGCTGGTCCGTTGTGAGGATGAAAGGGAACTGGGGAGACGATACGAACACACACACGTCGACACTAAGAGCGCCGCCGCCATGTTCGAGCTGCTGAGGAGGAAGCTCAGCCATACAGCTGCATATGGACACCTGCTGTCACTGCTGCAACATCTGCTACTTCTGCCATTGGAGTATAACCCTCACTCGCAGCACTGGCTACTGCTGGACCGGGTGGTGCAGCAGATAGTGCTACAGGCGCCTGGAAGGGACGGCTCTGATAGCAAGGTGTACAACCCGGATGTAGCTCCACTGGAGATAAATGTTGGAGAGATCGTACAACTCCTGGCGAAGGAAGAGGAACTGGTAGCGGCGAGGAATAAAGCTGACAACTTGGAAAGAGAAAACGTTGATCTCGCCACTGAGTTGGCCAAGAAGAAGAACGTCCCCACGCCCGGGAACCCGCTCAAGAGCTTCAATTGGAGCAAGCTGCCCGACGCCAAACTACACGGCACCATCTGGCAGGAGTTGGACGACACCAAGCTATACAACGCCATGGACCTGAACGCCATCGACAAAATGTTCTGCGCCTACCAGAAGAACGGAGTACAGAACGAGGGTTCAGTGGAAGACCTGCGCCAGCTGGGGTCCAAGCCGCGCTCCAAGATACTATCAGTGATCGACGGACGGAGAGCTCAGAACTGCACCATACTGCTGTCCAAACTCAAGATGACAGATGAAGAGATTTGTCGCGCTATCCTCCGTATGGACAGCGGGGAACAGCTGCCCCTGGATATGCTGGAGCAGTTGCTGAAGTTCACCCCCAGCGCGGAGGAGGCCGCCCTCCTCGAGGAACACCAGGACGAACTCGACTCCATGGCCAGGGCTGATCGATTCCTATACGAGATCTCCAAGATTCCCCATTATTCTCAGCGTGTGAGGACTCTTCTGTTCAAGAAGAAGTTTTCGCTCGCAGTGTCTGAAGCTTCCTCTCGAGCCTCCGTAGTACTGAGAGCTGCGAGGGACATGACACGCTCTAGAAGACTAAGAGCTTTGTTGGAAATTGTACTGGCGCTTGGCAACTACATGAACCGAGGTGCTCGCGGCAACGCGTCCGGTTTCCGTCTGACGTCACTCAACAAGCTAGCCGACACCAAGTCCAGTGTGACAAGGAACACGACACTACTTCATTACCTGGTCGAAATGTTGGAGACTCAGTTCAAGGACGTTCTTCTCCTGGAGGAAGACCTTCCTCACGTCCGCGCCGCTGCTAAAGTATGCGTGGACCAGCTCGAGAAGGACGTCGGCTCTTTGAGGACCGGTCTTCGGGAGGTTTCAAGGGAATTGGACTACCACGCCTCCCTGCTCTCCTCGCAACCCCATGACGCCTTCGTCCCCGTCATGAGAGAATTCCACGCTCATGCTGTGTGCTCCTTCACACAGCTCGAGGATCTCTTCCAGGATATGAAGAGTCGTCTGGAGGCTTGCGCGCACGCGTTCGGTGAGGAGCCCAGCACGTCTCCGGAACAACTGTTCGGAGCCCTGGACTCCTTCCTCACACAGCTGGCGGAGGCGAGAGCCGAGTGTGACGCGGCCAGGAGGAGGAGGGACGAGGAGGAGAGGAGGACCAGGCACGAGCAGGAGCTCAAAAAACGGACGATGGAAAGGAAACAGGGCTCCAGTTTACTGGGGTCGGTCGGGAAGTCGCTCGGGAAGACCAACGGGGACTGCAACGGACATGACGCGTCCAGAGACGGGACCTTGACCAACGGACAGAAAGGAGAGTTCGATGACCTCATATCCGCCCTGAGGACGGGGGACGTGTTCGGAGATGACGTAGCCAAGTTCAAGAGGTCCAGGAAAGCCAAGGCCAGGGGCAGAGACTCACCGCCTCGACCAGTCTGCCGAGAAGACTCCAGGGAGAGGCAGAAGAATTGA

Protein sequence:

>DPOGS201827-PA
MPPQDELDAKFAELVEELDLTAVNKAAMMELPAAKKWQIYCSRRPPPGQAPPLATAPQVEEYIKALNEISDAFASSENAPPSEATGLVDGLKTALRTRAHSFVLRFIKQGGLAALLEALQRAPRDDAITRHNLIAGIKALMNNSTGRAHVLAHPTSIDLIAQSMDTENVKTKVAALEILGAVCLVPGGHKKVLEAMIHYQKYAGERARFQGIVNELDRSTGAYRDDLGLKTAIMSFVNAVLNYGPGEESLEFRLHLRYELLMLGIQPVIEKLRKYENETLDRHIEFFELVRCEDERELGRRYEHTHVDTKSAAAMFELLRRKLSHTAAYGHLLSLLQHLLLLPLEYNPHSQHWLLLDRVVQQIVLQAPGRDGSDSKVYNPDVAPLEINVGEIVQLLAKEEELVAARNKADNLERENVDLATELAKKKNVPTPGNPLKSFNWSKLPDAKLHGTIWQELDDTKLYNAMDLNAIDKMFCAYQKNGVQNEGSVEDLRQLGSKPRSKILSVIDGRRAQNCTILLSKLKMTDEEICRAILRMDSGEQLPLDMLEQLLKFTPSAEEAALLEEHQDELDSMARADRFLYEISKIPHYSQRVRTLLFKKKFSLAVSEASSRASVVLRAARDMTRSRRLRALLEIVLALGNYMNRGARGNASGFRLTSLNKLADTKSSVTRNTTLLHYLVEMLETQFKDVLLLEEDLPHVRAAAKVCVDQLEKDVGSLRTGLREVSRELDYHASLLSSQPHDAFVPVMREFHAHAVCSFTQLEDLFQDMKSRLEACAHAFGEEPSTSPEQLFGALDSFLTQLAEARAECDAARRRRDEEERRTRHEQELKKRTMERKQGSSLLGSVGKSLGKTNGDCNGHDASRDGTLTNGQKGEFDDLISALRTGDVFGDDVAKFKRSRKAKARGRDSPPRPVCREDSRERQKN-