Monarch geneset OGS2.0

DPOGS209369
TranscriptDPOGS209369-TA1509 bp
ProteinDPOGS209369-PA502 aa
Genomic positionDPSCF300118 - 177119-185832
RNAseq coverage2806x (Rank: top 4%)
Annotation
HeliconiusHMEL0133646e-9360.54% 
BombyxBGIBMGA005701-TA0.090.26% 
Drosophilasn-PF0.069.32% 
EBI UniRef50UniRef50_Q245240.069.32%Protein singed n=38 Tax=cellular organisms RepID=SING_DROME
NCBI RefSeqXP_972494.10.079.21%PREDICTED: similar to fascin [Tribolium castaneum]
NCBI nr blastpgi|910893370.079.21%PREDICTED: similar to fascin [Tribolium castaneum]
NCBI nr blastxgi|910893370.079.21%PREDICTED: similar to fascin [Tribolium castaneum]
Group
Gene OntologyGO:00510151.3e-25actin filament binding
GO:00306741.3e-25protein binding, bridging
KEGG pathway 
InterPro domain[13-147] IPR0089992e-36Actin cross-linking
[27-141] IPR0227681.3e-25Fascin domain
Orthology groupMCL13567 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209369-TA
ATGAACGGAGCCGGAGACATCATCACACAAAACCAGCAGCGAGGCTGGTGGACCATAGGACTGATCAACTCGCGGTACAAGTACCTCACCGCGGAAACCTTCGGCTTCAAGATCAACGCTAACGGCACCAGCCTCAAGAAGAAACAGATCTGGACCCTGGAACCCGCTCCCGGCAAAGCCGATGACAGTATGATATATTTGAGGTCGCACCTGGACAAGTACCTCGCCGTGGACTCGTTCGGCAACGTGACCTGCGAGTCTGATGAGAAGGAGACCGGGAGCAAGTTCCAGATCAGTGTGTCGGAGGACGGGTCGGGGCGCTGGGCTCTGAAGAACGTGGAGCGAGGATACTTCCTGGGCTCGAGCTCGGACAAGCTGACCTGCACCGCCAAGGTCCCCGGAGACGCCGAGCTGTGGCACGTGCATCTGGCGGCCCGGCCTCAGATGAACCTGCGTTCCATCGGCCGTAAGCGGTTCGCTCACCTGTCCGAGTCCCTGGACGAGATCCACGTGGACGCCAACGTGCCGTGGGGCGAGGACACGCTGTTCACGCTGGAGTTCCGCGCGGACGAGGGCGGCAAGTACGCCCTCCACACCTGCAACAACAAGTACCTCAGCGCGCCCGGGAAGTTGCTGGACACGTGCACCCCGGAGTGCCTGTTCAGCGCGGAGTACCACGCGGGCGCGCTGGCGCTCCGGGACGCGGCGGGGGCCTACCTCGCGCCCATCGGCTCCAAGGCGGTGCTGAAGACTCGCTCCACGGCCGCCACGCGGGACGAGCTCTTCTCCCTGGAGGACAGCCTGCCGCAGGCCGCCTTCGTGGCCGCCCTCAACGACAAGCACGTGTCCGTCAAGCAAGGTGTGGACGTGACGGCCAACCAGGACGAGATCTCTTCCCATGAGACCTTCCAGTTGGAGTTCGACTGGGGCACTAAGCGCTGGTACATCCGCACCATGCAGGACCGGTACTGGACCCTGGAGACGGGCGGCGGCATACAGGCCAGCGGCGACAACAAGTCGTCCAACGCGCTGTTCGAGCTGTCGTGGCAGGGCGACGGCGCGGTGTCGTTCCGCGCCAACAACGGGAAGTACGTCCTCACCAAGCGCTCCGGACACCTGTACGCTAGCGCCGACACCGTCGACGACAACTGCAAGTACTACTTCTATCTCATCAACAGACCGATCCTGGTGCTGAAGTGCGAGCAAGGGTTCGTGGGCCCCAAGGGCGTCCGCCTGGAGTGCAACAAGGCCAACTACGAGACCATACAGGTCATCCGCGGACCCAAGGGAGCCGTCTACTTCAAGGGTCAGAACGGCAAGTACTGGCACGCGGACAGCGAGGCGGTGAGCTGCGACAGCGACTCGCCGCAGACCTTCTACCTGGAGCTGCGCGAGCCGACCCGCCTGGCCATCCGGTCGGGGTCGGGCCAGTACCTCGCCGCCGCCAAGAACGGCAACTTCCGCCTGGCCGGGCCCGAGCTGGCGCACGCCACGCACTGGGAGTACTAG

Protein sequence:

>DPOGS209369-PA
MNGAGDIITQNQQRGWWTIGLINSRYKYLTAETFGFKINANGTSLKKKQIWTLEPAPGKADDSMIYLRSHLDKYLAVDSFGNVTCESDEKETGSKFQISVSEDGSGRWALKNVERGYFLGSSSDKLTCTAKVPGDAELWHVHLAARPQMNLRSIGRKRFAHLSESLDEIHVDANVPWGEDTLFTLEFRADEGGKYALHTCNNKYLSAPGKLLDTCTPECLFSAEYHAGALALRDAAGAYLAPIGSKAVLKTRSTAATRDELFSLEDSLPQAAFVAALNDKHVSVKQGVDVTANQDEISSHETFQLEFDWGTKRWYIRTMQDRYWTLETGGGIQASGDNKSSNALFELSWQGDGAVSFRANNGKYVLTKRSGHLYASADTVDDNCKYYFYLINRPILVLKCEQGFVGPKGVRLECNKANYETIQVIRGPKGAVYFKGQNGKYWHADSEAVSCDSDSPQTFYLELREPTRLAIRSGSGQYLAAAKNGNFRLAGPELAHATHWEY-