Monarch geneset OGS2.0

DPOGS203186
TranscriptDPOGS203186-TA3255 bp
ProteinDPOGS203186-PA1084 aa
Genomic positionDPSCF300035 - 93206-107364
RNAseq coverage133x (Rank: top 56%)
Annotation
HeliconiusHMEL0221570.087.47% 
BombyxBGIBMGA011175-TA3e-17084.08% 
Drosophiladia-PD4e-16158.80% 
EBI UniRef50UniRef50_E0VPZ10.064.24%Diaphanous, putative n=1 Tax=Pediculus humanus corporis RepID=E0VPZ1_PEDHC
NCBI RefSeqXP_395654.30.050.34%PREDICTED: similar to Protein diaphanous [Apis mellifera]
NCBI nr blastpgi|3287915650.050.19%PREDICTED: hypothetical protein LOC412191 [Apis mellifera]
NCBI nr blastxgi|2420150370.054.71%diaphanous, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00037799.7e-136actin binding
GO:00160439.7e-136cellular component organization
GO:00300369.7e-136actin cytoskeleton organization
GO:00054881.2e-64binding
GO:00170481.5e-38Rho GTPase binding
KEGG pathwaymdo:1000117441e-158 
 K05745 (DIAPH3, DRF3)maps-> Regulation of actin cytoskeleton
InterPro domain[611-1051] IPR0031049.7e-136Actin-binding FH2/DRF autoregulatory
[575-1004] IPR0154251.1e-97Actin-binding FH2
[164-452] IPR0160241.2e-64Armadillo-type fold
[159-334] IPR0104731.5e-38Diaphanous GTPase-binding
[385-467] IPR0104722.4e-21Diaphanous FH3
[1034-1048] IPR0104658.4e-07DRF autoregulatory
[232-345] IPR0119895.7e-06Armadillo-like helical
Orthology groupMCL14540 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203186-TA
ATGCGAGCCAGCACCAACGCATCTCTCCAGCAGCAGGCCGTCGCCCGCGCACGCGAGCTCATGTCGTCATTCCGACCGTGCCTCGGGCACTCGGGCTATGCGACTAGTATGTTTATTAGCGCCCAAAAACACATCACGCTGCCAAAAAATAAGATCACACTCAAAAATATCGTGAAATTGTCCGAGCGGACGAAACCGCGGGCGCAGCTAGCTGACAATGGATTTGATCCAATCTACAAAAGTGTTTCCATTGTACTAAATTTGGACGTGACGAGCATTGTACTGTACTATATTACTGTCGACAAAAAAGGTGGTGTTGTGTGGCTGACGTGGAAGAAGGAGCTGGACACCTGGTTCGGAAGGCCGAAGAAGACTGGAGGTGGTATCGACCGTGGATACGACACCGTGCCCCGGGCGGAACTCAGGCCTGATGATGAGGAACCGGCCACGGAGTACACCGCCAAGATAGACGCCCTCGACGACGACCAGCTGGAGAGACGGTTCGAAGAAATGCTGACTGACATGAACATAGGAGAGAAGAAGAAGGAGCCGCTGAGAAAATACTCCAGAGACCAGAAGAAAAAGATGCTGGTGGCTTACAAGTTCGTTAACGCCCAGGAGGGTAGGTCGAAATTCGAGAAGCCCGCGGACTACGTGTCGTATCTGAACCAGCCAGAGCTATCGGTGGGTAAGCTGCACAGTTGCTTGGAGAACCTGAGGATATCCCTGACGAACAATCCCCTGTCATGGATCGAGGAGTTCGGGCCGAAGGGTATCGAGAGTCTGCTGACTACCCTCAATGTCTGCTATACTAACGATTCCCGTTACGATCGCGTCCAGTACGAGTGTATACGCTGTCTCTCCGCTATCCTGAACAACACGGTCGGTATCAGAGCGGTGTTCGACTGTCGGGAGGCTCTTCCCGTACTGGCCAGGAGTTTGGATGCCAGGAAACCACACTGTGCGCTGGAAGCTGCTAAGGTGTTGGCAGCGATATGTCTCATACCGAACGGTCATGAGAAGGTTCTGGAAGCTATCACAATGGCCGGGGAGTCCAGCCGCAGACCGAGACTGCTCCCCATCATCGAGGGTCTGTCTCCGAAGGCCCCTGAGAGTCTCAAGAACGGCTGCATCAGTATAATACTAACTTGGGTCGACTTCCACGATGTCAACGAATGCTTCGAGCTGGTTAAGAACCTTGTTGTGGAGACGCCAGCGGAGCCCTACCTGCTGTCCATACTGCAACACCTGCTCTTCATCAGGGACGATGAGCTCATCAGGCCGGCGTACTACAAGCTGATAGAGGAATGTGTCACGCAAATAGTCTTACACAAAAACGGATACGACCCCGACTTCCGGTTGACGCAGCGGTTTAACATAGATGTGCAGCCGCTGATAGAGGGACTCATAGAGAAATCGAGGGCGGAAGAAGAGAGGAAGGTGGAAGAGCTGAAGAGTAAGCTGGAGGCGGCGATAGCTGCCAGGCAGGAGGCGGAGGCCAGGGTCGCGCACCTGGAGCAGAGACTGAAGACGGCGCCGCCCAGCGGCCCGGGGGGAGTGACCCAGGGGAATATAGCCGCTATAGCTAAGGCGATAGGCAGCCCCGGCGGGCCGCCGCCCCCTCCCCCGCCGCCGATGCCAGGTGGGGGTCCTCCTCCCCCTCCGCCTCCACCCATGCCGGGCGCTGGGGCACCTCCCCCTCCCCCGCCCCCGATACCCGGAGGCCCGCCGCCGCCGCCCATGCCGGGGGGACCCAGGCCTCCACCGCCGCCCGGGATGCCTTCCGCCCCAAGGATGCCTCAACCGGATGTACTCCCTCACGGTCTGAAGCCCAAAAAGAAGTGGGAGGTCGAGGGACCCCTGAAGAGAGCGAATTGGAAAACCATAGTCCCCCAGAAGATGTCCGAGAAAGCTTTCTGGTTAAAGGTCCAAGAAGATAAGTTGGCTTCACCGGATATACTGACGGGATTAGCGCAGAAGTTCTCCAGCAAACCGATGGCTAAGAAGAACGAGGATAACGTCGACAGGGCCCACACCCTCAAAAAGGCGAAGGACCTCAAAGTGCTGGACAGTAAAGCGGCACAGAACCTGTCGATACTTCTGGGGGGCTCCCTGAAACACCTGTCGTACGAACACATCAAGACCTGCATACTGAGATGCGACACCACAGTACTTAATGCCAACGTACTGGATCTCCTGATACAGTACCTGCCGCCGGCGGACCAGCTCCGCAAGCTGTCCGATCTGCGGTGCTCCAGCGACGAGCTGACGGAGGCGGAGCAGTTCGCGGCCGTGGTCTCCGACGTGAAGAGACTCGCCCCCCGGCTCAGGAGTCTGGCCTTCAGGGAGCACTACCAAGAGATCGTCTCGGAGTTGAAGCCGGACATAGTGTCGGGTACAGCCGCGTGCGAGGAGGTCCGCTCCAGCGTGAAGTTCGCTCGTATCCTGGAACTGCTGCTGCTCCTGGGCAACTACATGAACACGGGCTCCAACAACGCCGGCGCCTACGGCTTCGAGATCAGCTTCATCACTAAGTTGACGGCGACGAAGGACTTGGAGAACAAGCAAACCCTGCTGCATTACCTCGTGAACACCATAGAGACCAAGTTCCCAGAAGTACTCAACTTCGCTGAGGAGATGCCGCACATTGATAGGGCCGCGAGGGTCTCGCCGGAGAATCTACAGAAGGCGCTGAAAAAGATGGAGAACGACATCCGCTCGCTAGAGACGGACCTCAACAACTCCAGGGTTCCGCAGTGCGCTGACGACCTGTTCCACGAGACCATGAGCAACTTCGCGAAGGAAGCCCGCGAGCAGTGCGACCTGCTGCACTCCATGTTCAAGAAGATGGAGTCGCTGTACGCCGAGCTGGCGGAGTACTACGTGTTCGACCCGGCCAAGTACACCCTGGAGGAGTTCTTCGCCGATGTCAAGACCTTCAAGGATTCCTTCGCGACGGCCCACCAGGAGAACGTTATAGCGCGAGAGACCGAGGAGAGAGCGAGGAGGGCCAGGGACGCGCGGGCGGCGGCGGAGAGGGACAGGAGGGACCGGCAGATGAGATACAAACAGTTCGTGGACATGGAGAGGGCGCAGGACGGGGTCATGGACAGCCTGATGGAGGCGCTGCAGAGCGGCTCGGCCTTCAGTCGCGAGAGACCGAGGAAGAAAGCCAATCCCAGAGTCGCCGGAGAGGATAGCGACGAGGAGCGCGAGCTCGTGAGGGCGATATTGAGTCGTATCGAAGGTAAATAA

Protein sequence:

>DPOGS203186-PA
MRASTNASLQQQAVARARELMSSFRPCLGHSGYATSMFISAQKHITLPKNKITLKNIVKLSERTKPRAQLADNGFDPIYKSVSIVLNLDVTSIVLYYITVDKKGGVVWLTWKKELDTWFGRPKKTGGGIDRGYDTVPRAELRPDDEEPATEYTAKIDALDDDQLERRFEEMLTDMNIGEKKKEPLRKYSRDQKKKMLVAYKFVNAQEGRSKFEKPADYVSYLNQPELSVGKLHSCLENLRISLTNNPLSWIEEFGPKGIESLLTTLNVCYTNDSRYDRVQYECIRCLSAILNNTVGIRAVFDCREALPVLARSLDARKPHCALEAAKVLAAICLIPNGHEKVLEAITMAGESSRRPRLLPIIEGLSPKAPESLKNGCISIILTWVDFHDVNECFELVKNLVVETPAEPYLLSILQHLLFIRDDELIRPAYYKLIEECVTQIVLHKNGYDPDFRLTQRFNIDVQPLIEGLIEKSRAEEERKVEELKSKLEAAIAARQEAEARVAHLEQRLKTAPPSGPGGVTQGNIAAIAKAIGSPGGPPPPPPPPMPGGGPPPPPPPPMPGAGAPPPPPPPIPGGPPPPPMPGGPRPPPPPGMPSAPRMPQPDVLPHGLKPKKKWEVEGPLKRANWKTIVPQKMSEKAFWLKVQEDKLASPDILTGLAQKFSSKPMAKKNEDNVDRAHTLKKAKDLKVLDSKAAQNLSILLGGSLKHLSYEHIKTCILRCDTTVLNANVLDLLIQYLPPADQLRKLSDLRCSSDELTEAEQFAAVVSDVKRLAPRLRSLAFREHYQEIVSELKPDIVSGTAACEEVRSSVKFARILELLLLLGNYMNTGSNNAGAYGFEISFITKLTATKDLENKQTLLHYLVNTIETKFPEVLNFAEEMPHIDRAARVSPENLQKALKKMENDIRSLETDLNNSRVPQCADDLFHETMSNFAKEAREQCDLLHSMFKKMESLYAELAEYYVFDPAKYTLEEFFADVKTFKDSFATAHQENVIARETEERARRARDARAAAERDRRDRQMRYKQFVDMERAQDGVMDSLMEALQSGSAFSRERPRKKANPRVAGEDSDEERELVRAILSRIEGK-