Monarch geneset OGS2.0

DPOGS200827
TranscriptDPOGS200827-TA1566 bp
ProteinDPOGS200827-PA491 aa
Genomic positionDPSCF300071 - 552698-559491
RNAseq coverage487x (Rank: top 26%)
Annotation
HeliconiusHMEL0114710.093.26% 
BombyxBGIBMGA009881-TA0.091.51% 
DrosophilaCG32138-PA1e-17060.31% 
EBI UniRef50UniRef50_D6X0200.070.90%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X020_TRICA
NCBI RefSeqXP_001815658.10.070.68%PREDICTED: similar to AGAP004805-PA, partial [Tribolium castaneum]
NCBI nr blastpgi|3454852420.072.79%PREDICTED: formin-like protein CG32138-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|3454852429e-17872.79%PREDICTED: formin-like protein CG32138-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00054885e-85binding
GO:00037792.8e-43actin binding
GO:00160432.8e-43cellular component organization
GO:00170484.9e-15Rho GTPase binding
GO:00300364.9e-15actin cytoskeleton organization
KEGG pathwaydan:Dana_GF221761e-20 
 K04512 (DAAM)maps-> Wnt signaling pathway
InterPro domain[1-419] IPR0160245e-85Armadillo-type fold
[245-440] IPR0104722.8e-43Diaphanous FH3
[184-242] IPR0104734.9e-15Diaphanous GTPase-binding
Orthology groupMCL10500 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200827-TA
ATGGACCTGCCGCCAGACAAGGCAAAGCTGCTGCGGAACTATGACTTGGAAAAGAAATGGGAGATCATATGCGACCAAGACATGGTGCAGGCGAAGGACTCGCCCGCCCACTATCTCAACAAACTGAGGACCTACCTTGACCCTAAGGCGTCCAGGAGTCACAGAAAGAGAAAGATGGTCGGTGACTCCACGTCGACGCAGGTTCTTAGGGATTTAGAAATATCACTGCGAACTAATCACATCGAATGGGTCCGTGAGTTCCTGAACGATCAGAATCAAGGTCTGGATGTGTTGATCGACTACCTCAGCTTCAGACTGAGCATGATGAGGCACGAACAGCGAATAGCACTCGCGAGGAGCCACTCCACAGACGCCATCAACCAAGCGAACACGACGACTTCAGAGTGCAGCGGGCCGGAGATGGGTGCGGGCTCCACGTGGCGGCGGAGGGCGAGGTCCGCGGACTCGGAGGGCGAAGGTCCGGGGGCGGGGTCCCCGGCCGCGGCCAGACGGAGGACCAGGCATGCGGCGAGGCTCAACATGGGCGCCTCCACTGATGATATACACGTCTGCATCATGTGCATGAGGGCCATCATGAACAATAAGTATGGCTTCAACATGGTGATCCAACATCGCGAGGCTATCAACAGCATAGCCTTGTCCCTGGTACATCACTCGCTCAGAACGAAGGCACTGGTGTTGGAACTGTTGGCGGCCATCTGCCTGGTGAAGGGCGGTCATCAGATCATTCTCTCCGCCTTCGATAACTTCAAGGAGGTGGTCGGTGAGCCGAGGAGGTTCCACACTCTCATGGAGTACTTCATGAACTATGACAGCTTCCATATTGAGTTCATGGTGGCGTGTATGCAGTTCGTCAACATAATAGTACATTCAGTAGAGGACATGAACTTCCGGGTGCACCTCCAGTACGAGTTCACGGCGCTCAAACTAGACGACTACCTCGAGAGGCTGAGGCTCTGTGAGAGCGAAGACTTACAGGTCCAAATATCAGCGTACCTCGACAACGTGTTCGACGTGGCGGCTCTCATGGAGGACAGCGAGACGAAGACGGCGGCCTTGGAGAAGGTCAATGAACTGGAGGATGAATTGGGACATGCTCACGAGCGACTGGCGTCCTTGGAGAGAGAAGCCATCGCTAAACAAGCAACGCTGGAGGCGGAACTAGCGCAAGTTAGACACGAGAGAGACCAGCTCGCTGAAGCACGGAGGCAGGTCGTGGAGGAGGTGTCGACTCTGAGGAGAGCTCAGCAGGACTCGAGGAACAGGCAGTCGATGTTGGAGTCGAAGGTGCAGGAACTGGAATCGCTGACCAAGTCACTACCACGAGGAGCCTCCACTGACGAAACATCAACAAAAACACAAATAATAATGATATATAATTCCTCGCATCCTCCTCATGCGGATGATGAGGGATTGGAACAGCTGCCTGCGTCTGTTTCCTTCGACTTATACTGAGATAGTACGCTCACCACCTACGACTTCGTTTCGATGTTCCAATTGGTAAAATGACAGTGAAACACTGTCGCGTGTGTGTCGGTTGCTTAA

Protein sequence:

>DPOGS200827-PA
MDLPPDKAKLLRNYDLEKKWEIICDQDMVQAKDSPAHYLNKLRTYLDPKASRSHRKRKMVGDSTSTQVLRDLEISLRTNHIEWVREFLNDQNQGLDVLIDYLSFRLSMMRHEQRIALARSHSTDAINQANTTTSECSGPEMGAGSTWRRRARSADSEGEGPGAGSPAAARRRTRHAARLNMGASTDDIHVCIMCMRAIMNNKYGFNMVIQHREAINSIALSLVHHSLRTKALVLELLAAICLVKGGHQIILSAFDNFKEVVGEPRRFHTLMEYFMNYDSFHIEFMVACMQFVNIIVHSVEDMNFRVHLQYEFTALKLDDYLERLRLCESEDLQVQISAYLDNVFDVAALMEDSETKTAALEKVNELEDELGHAHERLASLEREAIAKQATLEAELAQVRHERDQLAEARRQVVEEVSTLRRAQQDSRNRQSMLESKVQELESLTKSLPRGASTDETSTKTQIIMIYNSSHPPHADDEGLEQLPASVSFDLY-