Monarch geneset OGS2.0

DPOGS203185
TranscriptDPOGS203185-TA3390 bp
ProteinDPOGS203185-PA1129 aa
Genomic positionDPSCF300035 - 107823-125236
RNAseq coverage553x (Rank: top 23%)
Annotation
HeliconiusHMEL0221571e-17576.24% 
BombyxBGIBMGA011175-TA2e-13671.73% 
Drosophiladia-PD1e-14050.09% 
EBI UniRef50UniRef50_E0VPZ13e-15754.65%Diaphanous, putative n=1 Tax=Pediculus humanus corporis RepID=E0VPZ1_PEDHC
NCBI RefSeqXP_002428185.15e-15854.65%diaphanous, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3504193290.048.73%PREDICTED: hypothetical protein LOC100741633 [Bombus impatiens]
NCBI nr blastxgi|3838651420.051.40%PREDICTED: uncharacterized protein LOC100883678 [Megachile rotundata]
Group
Gene OntologyGO:00037795.7e-95actin binding
GO:00160435.7e-95cellular component organization
GO:00300365.7e-95actin cytoskeleton organization
GO:00054883.6e-90binding
GO:00170481.6e-38Rho GTPase binding
KEGG pathwaycfa:4854744e-145 
 K05745 (DIAPH3, DRF3)maps-> Regulation of actin cytoskeleton
InterPro domain[620-1065] IPR0031045.7e-95Actin-binding FH2/DRF autoregulatory
[69-421] IPR0160243.6e-90Armadillo-type fold
[541-1018] IPR0154251.5e-72Actin-binding FH2
[242-436] IPR0104724.9e-51Diaphanous FH3
[64-239] IPR0104731.6e-38Diaphanous GTPase-binding
[137-320] IPR0119893.4e-07Armadillo-like helical
[1048-1062] IPR0104658.8e-07DRF autoregulatory
Orthology groupMCL14540 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203185-TA
ATGTACAACAGCATGCCGAGACTGAAGAGAAGGGAGACGTGGAAGAGGAGAAGCGGGCTGGACACCTGGTTCGGAAGGCCGAAGAAGACTGGAGGTGGTATCGACCGTGGATACGACACCGTGCCCCGGGCGGAACTCAGGCCCGATGATGAGGAACCGGCCACGGAGTACACCGCCAAGATAGACGCCCTCGACGACGACCAGCTGGAGAGACGGTTCGAAGAAATGCTGACTGACATGAACATAGGAGAGAAGAAGAAGGAGCCGCTGAGAAAATACTCCAGAGACCAGAAGAAAAAGATGCTGGTGGCTTACAAGTTCGTTAACGCCCAGGAGGGTAGGTCGAAATTCGAGAAGCCCGCGGACTACGTGTCGTATCTGAACCAGCCAGAGCTATCGGTGGGTAAGCTGCACAGTTGCTTGGAGAACCTGAGGATATCCCTGACGAACAATCCCCTGTCATGGATCGAGGAGTTCGGGCCGAAGGGTATCGAGAGTCTGCTGACTACCCTCAATGTCTGCTATACTAACGATTCCCGTTACGATCGCGTCCAGTACGAGTGTATACGCTGTCTCTCCGCTATCCTGAACAACACGGTCGGTATCAGAGCGGTGTTCGACTGTCGGGAGGCTCTTCCCGTACTGGCCAGGAGTTTGGATGCCAGGAAACCACACTGCGCGCTGGAGGCTGCTAAGGTGTTGGCAGCGATATGTCTCATACCGAACGGTCATGAGAAGGTTCTGGAAGCTATCACAATGGCCGGGGAGTCCAGCCGCAGACCGAGACTGCTGCCCATCATCGAGGGTCTGTCTCCGAAGGCCCCTGAGAGTCTCAAGAACGGCTGCATGCAGCTCATGAACGCTATCATAACTGAACCGGAAGAGCTGGAGTTCAGGATGCATCTGAGGAGCGAGTTCATGAGAACCGGCCTTTATGACCTGATGGACTCCCTCCAGGCGGGCGCGGAGGACGGCGGCCGGCTTATCCAGATGAACGTGTTCGACACCCACGCTGCAGCGGACCAAGAGGAGTTCCTCGCCAAGTTTGATGATGTGAGGGTCGACTTCCACGATGTCAACGAATGCTTCGAGCTGGTTAAGAACCTTGTTGTGGAGACGCCAGCGGAGCCCTACCTGCTGTCCATACTGCAACACCTGCTCTTCATCAGGGACGATGAGCTCATCAGGCCGGCGTACTACAAGCTGATAGAGGAATGTGTCACGCAAATAGTCTTACACAAAAACGGATACGACCCCGACTTCCGGTTGACGCAGCGGTTTAACATAGATGTGCAGCCGCTGATAGAGGGACTCATAGAGAAATCGAGGGCTGAAGAAGAGAGGAAGGTGGAAGAGCTGAAGAGTAAGCTGGAGGCGGCGATAGCTGCCAGGCAGGAGGCGGAGGCCAGGGTCGCGCACCTGGAGCAGAGACTGAAGACGGCGCCGCCCAGCGGCCCGGGGGGAGTGACCCAGGGGAATATAGCCGCTATAGCTAAGGCGATAGGCAGCCCCGGCGGGCCGCCGCCCCCTCCCCGCCGCCGATGCCAGGTGGGGGTCCTCCTCCCCCTCCGCCTCCACCCATGCCGGGCGCTGGGGCCCCTCCCCCTCCCGATACCCGGAGGCCCGCCGCCGCCGCCCATGCCGGGGGGACCCAGGCCTCCACCGCCGCCCGGGATGCCTTCCGCCCCAAGGATGCCTCAACCGGATGTACTCCCTCACGGTCTGAAGCCCAAAAAGAAGTGGGAGGTCGAGGGACCCCTGAAGAGAGCGAATTGGAAAACCATAGTCCCCCAGAAGATGTCCGAGAAAGCTTTCTGGTTAAAGCCTCAACCGGATGTACTCCCTCACGGTCTGAAGCCCAAAAAGAAGTGGGAGGTCGAAGGACCCCTGAAGAGAGCGAATTGGAAAACCATAGTCCCCCAGAAGATGTCCGAGAAAGCTTTCTGGTTAAAGGTCCAAGAAGATAAGTTGGCTTCACCGGATATACTGACGGGATTAGCGCAGAAGTTCTCCAGCAAACCGATGGCTAAGAAGAACGAGGATAACGTCGACAGGGCCCACACCCTCAAAAAGGCGAAGGACCTCAAAGTGCTGGACAGTAAAGCGGCACAGAACCTGTCGATACTTCTGGGGGGCTCCCTGAAACACCTGTCGTACGAACACATCAAGACCTGCATACTGAGATGCGACACCACAGTACTTAATGCCAACGTACTGGATCTCCTGATACAGTACCTGCCGCCGGCGGACCAGCTCCGCAAGCTGTCCGATCTGCGGTGCTCCAGCGACGAGCTGACGGAGGCGGAGCAGTTCGCGGCCGTGGTCTCCGACGTGAAGAGGCTCGCCCCCCGGCTCAGGAGTTTGGCCTTCAGGGAGCACTACCACGAGATCGTCTCGGAGTTGAAGCCGGACATAGTGTCGGGTACAGCCGCGTGCGAGGAGGTCCGCTCCAGCGTGAAGTTCGCTCGCATCCTGGAACTGCTGCTACTCCTGGGCAACTACATGAACACGGGCTCCAACAACGCCGGCGCCTACGGCTTCGAGATCAGCTTCATCACTAAGGGTGCTATAGCGACACACACCGCCCAGCCAGCGCCACCTGCCGCGCGTCGCGTGACTGCCCACAGCTGTTTCCACATACACTATTTACCTCCTTTTAAACTGGCGGCGCGCGTGATCCAATTGAGTGTCTTATATATAACGTCATCGATATCGTATTTCGCCAGGGCCGCGAGGGTCTCGCCGGAGAATCTACAGAAGGCGCTGAAAAAGATGGAGAACGACATCCGCTCGCTAGAGACGGACCTCAACAACTCCAGGGTTCCGCACCAGAACTTCGCGAAGGAAGCCCGCGAGCAGTGCGACCTGCTGCACTCCATGTTCAAGAAGATGGAGTCGCTGTACGCCGAGCTGGCGGAGTACTACGTGTTCGACCCGGCCAAGTACACCCTGGAGGAGTTCTTCGCCGATGTCAAGACCTTCAAGGATTCCTTCGCGACGGCCCACCAGGAGAACGTTATAGCGCGAGAGACCGAGGAGAGAGCGAGGAGGGCCAGGGACGCGCGGGCGGCGGCGGAGAGGGACAGGAGGGACCGGCAGATGAGATACAAACAGTTCGTGGACATGGAGAGGGCGCAGGACGGGGTCATGGACAGCCTGATGGAGGCGCTGCAGAGCGGCTCGGCCTTCAGTCGCGAGAGACCGAGGAAGAAAGCCAATCCCAGAGTCGCCGGAGAGGATAGCGACGAGGAGCGCGAGCTCGTGAGGGCGATATTGAGTCGTATCGAAGCTGAGAGAAGAGCACAGCTGAACAGGTCGCGCTCCAGGTCCGGCCTGAGCGGGCCGCTGACGTCTCGCGAGCTGACCAATGAGTTGCTCGGGAACGCGTGA

Protein sequence:

>DPOGS203185-PA
MYNSMPRLKRRETWKRRSGLDTWFGRPKKTGGGIDRGYDTVPRAELRPDDEEPATEYTAKIDALDDDQLERRFEEMLTDMNIGEKKKEPLRKYSRDQKKKMLVAYKFVNAQEGRSKFEKPADYVSYLNQPELSVGKLHSCLENLRISLTNNPLSWIEEFGPKGIESLLTTLNVCYTNDSRYDRVQYECIRCLSAILNNTVGIRAVFDCREALPVLARSLDARKPHCALEAAKVLAAICLIPNGHEKVLEAITMAGESSRRPRLLPIIEGLSPKAPESLKNGCMQLMNAIITEPEELEFRMHLRSEFMRTGLYDLMDSLQAGAEDGGRLIQMNVFDTHAAADQEEFLAKFDDVRVDFHDVNECFELVKNLVVETPAEPYLLSILQHLLFIRDDELIRPAYYKLIEECVTQIVLHKNGYDPDFRLTQRFNIDVQPLIEGLIEKSRAEEERKVEELKSKLEAAIAARQEAEARVAHLEQRLKTAPPSGPGGVTQGNIAAIAKAIGSPGGPPPPPRRRCQVGVLLPLRLHPCRALGPLPLPIPGGPPPPPMPGGPRPPPPPGMPSAPRMPQPDVLPHGLKPKKKWEVEGPLKRANWKTIVPQKMSEKAFWLKPQPDVLPHGLKPKKKWEVEGPLKRANWKTIVPQKMSEKAFWLKVQEDKLASPDILTGLAQKFSSKPMAKKNEDNVDRAHTLKKAKDLKVLDSKAAQNLSILLGGSLKHLSYEHIKTCILRCDTTVLNANVLDLLIQYLPPADQLRKLSDLRCSSDELTEAEQFAAVVSDVKRLAPRLRSLAFREHYHEIVSELKPDIVSGTAACEEVRSSVKFARILELLLLLGNYMNTGSNNAGAYGFEISFITKGAIATHTAQPAPPAARRVTAHSCFHIHYLPPFKLAARVIQLSVLYITSSISYFARAARVSPENLQKALKKMENDIRSLETDLNNSRVPHQNFAKEAREQCDLLHSMFKKMESLYAELAEYYVFDPAKYTLEEFFADVKTFKDSFATAHQENVIARETEERARRARDARAAAERDRRDRQMRYKQFVDMERAQDGVMDSLMEALQSGSAFSRERPRKKANPRVAGEDSDEERELVRAILSRIEAERRAQLNRSRSRSGLSGPLTSRELTNELLGNA-