Monarch geneset OGS2.0

DPOGS207837
TranscriptDPOGS207837-TA3705 bp
ProteinDPOGS207837-PA1234 aa
Genomic positionDPSCF300042 + 1052486-1063957
RNAseq coverage406x (Rank: top 30%)
Annotation
HeliconiusHMEL0153030.087.99% 
BombyxBGIBMGA005519-TA0.085.76% 
DrosophilafliI-PA0.064.35% 
EBI UniRef50UniRef50_G6DL200.097.70%Putative uncharacterized protein n=4 Tax=Ditrysia RepID=G6DL20_DANPL
NCBI RefSeqXP_001842948.10.068.20%flightless-1 [Culex quinquefasciatus]
NCBI nr blastpgi|2700137720.069.85%hypothetical protein TcasGA2_TC012416 [Tribolium castaneum]
NCBI nr blastxgi|2700137720.070.03%hypothetical protein TcasGA2_TC012416 [Tribolium castaneum]
Group
Gene OntologyGO:00037793.8e-22actin binding
KEGG pathwaygga:3957743e-74 
 K05768 (GSN)maps-> Regulation of actin cytoskeleton
    Fc gamma R-mediated phagocytosis
InterPro domain[90-1230] IPR0071220Gelsolin
[505-587] IPR0071232.6e-13Gelsolin domain
Orthology groupMCL13339 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207837-TA
ATGGCAAATACAGGCTTGTTACCATTTGTGCGTGGTGTGGATTTTACATGCAATGACTTCAGTGGTGACAAATTCCCCGATGCTATCCGATATATGACCGGTTTGCAATGGCTCAGACTCGATAAAACAAACTTAGAAGAGATTCCCGAAGAATTGGGAAAACTTATGAAATTGGAAAATTTGTCACTCAAAAAGAATAATCTAGAAAAGTTATTTGGAGAATTGACAGAGTTAAAATGTCTAAGATCTTTGAATGTTAGGCATAATAATGTGAAGACGTCAGGAATACCTGCTGAACTATTCAGATTAGATGATCTAACTACTCTAGATCTTTCTCATAATAGGCTAAAGGAAGTTCCTGAAGGGCTAGAAAAAGCTAAATCATTGCTTGTATTAAATCTGAGTCACAACAAAATAGAAAGCATACCACCTACTTTATTTGTTCAGTTGACGGATCTACTGTTTTTAGACCTATCTAGTAATTTACTGGAGACTCTGCCCCCTCAGACTAGAAGGTTGGCAAACCTGCAGACTTTAATACTAAACGATAACCCATTAGGATTATTTCAATTGAGGCAGTTACCGTCTCTACAAAGCCTCGAGACATTACACATGAGGAACACACAGAGAACACTAGCTAATCTACCAACATCACTCGAACCCCTCATAAATTTATCTGATGTTGACTTGTCCAAAAACGCCCTGACAAAAGTGCCTGACGCACTTTACACGCTACAGAACATAAAAAGGCTTAATCTAAGTGAAAATGAGATAACAGAAATATCTACTGCTATGGACATCTGGCAGAAACTGGAAAGTTTAAATTTATCCCGGAACAAACTCACAACCTTACCGGCCACTCTGTGCAAACTGCAGAGCTTAAGGCGGCTTCATGTAGATGACAACAAGCTGGATTTCGAAGGAATTCCTTCTGGAATAGGAAAACTTGGCAATTTAGAAGTATTCTCGGCTGCTAATAATCTACTTGAAATGATCCCTGAGGGACTTTGCAGATGCGGTTCTTTAAAGAAATTAAACCTGAGCTCAAACAAATTAATAACATTGCCGGATGCTATACATCTACTGAGTGACCTTGAGAGTTTGCAATTACATGGCAACCCGGACCTGGTGATGCCGCCAAAGCCTGTAGAGAGAGCGAGGGGAGCTGGTCTCCAGTACTATAATATAGACTTCTCTCTACAAACACAGTTGCAATTGGCTGGTGCGGCAACTCCTGAACTGGCAGACAACAATAGTATCAGTAAGTCGGACCCCATCGCTCGCAAGCTCCGTCTCCGTCGTCGAGGTGAGACTGACAACTCTGATGTTATCCTTCGAGGAATGCAGGAACAGGCCAACACCCAGCACGCAACCAGCAACGATGAAATGCCACACAGCGAACTGAAACCTAAACGCTGGGACGAAAGTTTGGAAAAACCACCGTTAGATTACTCGGAGTTCTTCGACGAGGATACCGGTCAAGCTCCGGGTCTCCAGATATGGGAGATTGAGAACTTCATCCCCGCACCCGTTGACGAAGTCGCACACGGTAAGTTCTTCGAGGGTGACTGCTACATAGTACTGAAGACGTCCATCGAAGAACAAGGACAGCTGTCTTGGGATATTCACTTCTGGATTGGATCCAAAGCGACGTTGGACAAGGGCGCGTGTGCAGCAATGCACGCTGTGAACCTAAGAAACTTGTTGGGAGCCAAACGCACCCAGAGACACGAACAAGGGGACGAGTCACCGGAGTTCCTGGCGCTGTTCCCGACACCCCCTGTATATATCAACGGCAGTAGAACTCCGTCAGGCTTCTTCACCGTCGATGATCCGCACTACGTGACCCGTCTGTACCGCGTCCACGGAGCCGGCAGCTCGATACACCTGGAGCCATCCCCCGTGTCAGCGAGCTCGCTCGACCCGAGATACGTCTTCGTACTTGACACGGGATTGCGCATACACTTATGGAATGGGAAAAAGGCAAAGAATACGCTAAAGTCCAAGGCGCGGCTGTTCGCTGAGAAGATAAATAAAGAAGAGAGGAAGAACAAAGCCGAATTAATAGCAGAAGTACCAGGAAAAGAGTCTAAGAACTTCTGGCAAGTCTTAGGATATGAAGACGATATGCCTTACGTTGCAGAAGAACACGTCCCTGACAACTTCACGTGGTCCCCAGCTAGACTGTACCGCGTGGAGCTCGGCATGGGATATCTGGAACTGCCTCAGACCGAAGGACCTCTTACAAGGACGATACTGGCCACGAGGAATGTGTACATACTAGACGCGCACCAGGATCTGTTTGTATGGTTCGGCAAGAAATCATCACGTTTAGTCCGAGCGGCGGCTGTGAAGCTCGCCCAGGAATTGTTCAGTATGGCGCCCAGAGAACCTCACGCACTGGTCACCAGGTTACAGGAGGGCACAGAAACACAGGTGTTCAAGACGTACTTCCAAGGCTGGGAGGAGGTTATAGCGGTGGACTTCACGCGAACAGCCGAGTCTGTGGCGAGGACCGGGGCCGACCTCACGTCCTGGGCCAGGCAGCAAGAGACCAAAACGGATCTATCGGCACTGTTCACACCTCGTCAGCCGGCCATGTCTCCGACCGAGGCTAAGAGTCTCGCTGACGAGTGGAACGAAGACCTGGAAGCGATGGAGGCCTTTGTTCTGGAAGGCCGGCATTTCGTCCGTCTTCCAGATCAAGAGTTGGGGGTCTTTTACAGCTGTGATTGCTACGTATTCCTCTGTAGATACGTACTGCCTGTAGAGGCTGATGATGACACGCCCGAAGCGGACGAAGTGGACTCGGAGAGCGACAGTGTGACTTGGGTGGTGTACTTCTGGCAAGGCAGGAGAGCTCCCAACATGGGCTGGCTGACATTCACCTTCGGCTTGGAGAGGAAATTCAAACAGCTCTGCAAGAGACTAGACGTGGTCAGAACACATCAGCAACAGGAGAGCCTCAAGTTCATGGCACATTTCAGAAGAAGATTCATAATACGAGATGGAAAACGAAATTTAAAACCGGAGGGCCGGCCGCCAGTGGAGTTGTTCGAGCTCCGGAGTAATGGCTCGTCGTTATGTACGCGACTTGTGCAAGTCAAACCAGATGCCAGCGTGCTGAACAGTGCTTTCTGTTATATCCTGAACGTTCCTTTGGAGGGCTCTAAGGAAGAGTCGTCAGCGATCGTGTACGCCTGGATCGGCTCCAAGAGCGACGCTGACTCCGCCCGTCTCATAGAACTCATCGCCAATGAAAAATTTAACAACGATTTCGTCAGTCTACAGGTGTTAACAGAGGGCAGTGAACCTGACAATTTCTTTTGGGTAGCGCTGGGCGGGCGAAAGCCTTACGACGAAGACGCCGAATACTTAAACTACACGCGACTCTTCAGATGTTCCAACGAAAAGGGATACTTCACTGTATCTGAGAAGTGGACGGACTTCTGTCAAGATGACCTCGCTGATGATGATATTATGATACTGGACAATGGTGAGCAAGTGTTCCTGTGGCTCGGTGCGAGATGTTCAGAGGTGGAAATCAAACTGGCTTATAAATCAGCTCAGGTATACATTCAGCATATGAAAACGACCCAACCGGACAGACCTCGAAAACTGTTCCTCACATTGAAGGACAAGGAGTCTCGGAGATTTACCAAGTGCTTCCACGGCTGGGGGGAACACAAGAAACCTCCGGAATAA

Protein sequence:

>DPOGS207837-PA
MANTGLLPFVRGVDFTCNDFSGDKFPDAIRYMTGLQWLRLDKTNLEEIPEELGKLMKLENLSLKKNNLEKLFGELTELKCLRSLNVRHNNVKTSGIPAELFRLDDLTTLDLSHNRLKEVPEGLEKAKSLLVLNLSHNKIESIPPTLFVQLTDLLFLDLSSNLLETLPPQTRRLANLQTLILNDNPLGLFQLRQLPSLQSLETLHMRNTQRTLANLPTSLEPLINLSDVDLSKNALTKVPDALYTLQNIKRLNLSENEITEISTAMDIWQKLESLNLSRNKLTTLPATLCKLQSLRRLHVDDNKLDFEGIPSGIGKLGNLEVFSAANNLLEMIPEGLCRCGSLKKLNLSSNKLITLPDAIHLLSDLESLQLHGNPDLVMPPKPVERARGAGLQYYNIDFSLQTQLQLAGAATPELADNNSISKSDPIARKLRLRRRGETDNSDVILRGMQEQANTQHATSNDEMPHSELKPKRWDESLEKPPLDYSEFFDEDTGQAPGLQIWEIENFIPAPVDEVAHGKFFEGDCYIVLKTSIEEQGQLSWDIHFWIGSKATLDKGACAAMHAVNLRNLLGAKRTQRHEQGDESPEFLALFPTPPVYINGSRTPSGFFTVDDPHYVTRLYRVHGAGSSIHLEPSPVSASSLDPRYVFVLDTGLRIHLWNGKKAKNTLKSKARLFAEKINKEERKNKAELIAEVPGKESKNFWQVLGYEDDMPYVAEEHVPDNFTWSPARLYRVELGMGYLELPQTEGPLTRTILATRNVYILDAHQDLFVWFGKKSSRLVRAAAVKLAQELFSMAPREPHALVTRLQEGTETQVFKTYFQGWEEVIAVDFTRTAESVARTGADLTSWARQQETKTDLSALFTPRQPAMSPTEAKSLADEWNEDLEAMEAFVLEGRHFVRLPDQELGVFYSCDCYVFLCRYVLPVEADDDTPEADEVDSESDSVTWVVYFWQGRRAPNMGWLTFTFGLERKFKQLCKRLDVVRTHQQQESLKFMAHFRRRFIIRDGKRNLKPEGRPPVELFELRSNGSSLCTRLVQVKPDASVLNSAFCYILNVPLEGSKEESSAIVYAWIGSKSDADSARLIELIANEKFNNDFVSLQVLTEGSEPDNFFWVALGGRKPYDEDAEYLNYTRLFRCSNEKGYFTVSEKWTDFCQDDLADDDIMILDNGEQVFLWLGARCSEVEIKLAYKSAQVYIQHMKTTQPDRPRKLFLTLKDKESRRFTKCFHGWGEHKKPPE-