Monarch geneset OGS2.0

DPOGS206586
TranscriptDPOGS206586-TA3099 bp
ProteinDPOGS206586-PA1032 aa
Genomic positionDPSCF300108 + 683312-704285
RNAseq coverage290x (Rank: top 38%)
Annotation
HeliconiusHMEL0043722e-8852.77% 
BombyxBGIBMGA013748-TA7e-6379.58% 
Drosophilamim-PF4e-4743.26% 
EBI UniRef50UniRef50_D6WQ044e-8842.15%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WQ04_TRICA
NCBI RefSeqXP_973999.26e-8842.08%PREDICTED: similar to CDNA sequence BC060632 [Tribolium castaneum]
NCBI nr blastpgi|2700097012e-8742.15%hypothetical protein TcasGA2_TC008993 [Tribolium castaneum]
NCBI nr blastxgi|2700097011e-12835.96%hypothetical protein TcasGA2_TC008993 [Tribolium castaneum]
Group
Gene OntologyGO:00468472.1e-43filopodium assembly
GO:00071652.1e-43signal transduction
GO:00171242.1e-43SH3 domain binding
GO:00080932.1e-43cytoskeletal adaptor activity
KEGG pathway 
InterPro domain[9-191] IPR0136062.1e-43IRSp53/MIM homology domain (IMD)
Orthology groupMCL15631 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206586-TA
ATGTATGTTTGTTTATTTGTTACAGGAGCCACAAAGGAAATAGGAACCGCCTTGACAAGGGTCTGCCTCAGACACAGGGCTATAGAGACTAGAATGAAGACGTTTATCAGCACGTTGATGGAGCGTCTTATAGCTCCACTATCGGAACGAGCTGATGAATGGCGTCGCGGTTGTAATGTTACCGGCGCATTGAGCCGAGAACACGCACGTGAGTGTAAACGCGCCAGGGCTGAGCTGCGCAGGCGGGTCCATGATGCTCAGAGACACGCTAGGAAAGCGAGGCGGACGAATCCGGATGTTAAGCGGAGAGCTGACGTTTGTTTGCAGGATATCCAAGAACGAAAGCAGCAATTGGAGGAGATGGAAGAGAAGGCGGTGAAGGCCGCTCTGATAGAAGAACGGAGCCGGTTCTGTAACTTCGTGTCTCTTCTCAGTCCTGTTGTGGAGAGTGAAGTGGCAATGCTAGCAGAGGTCAGTCACCTTCAAGAGGGAACAGAACAGTTGTCGAGGCAGATTTCGGAACCTCGGAGTTTACCCCCTGGGAGTCTACAGGTGATTTGCGATATAAAGTCTTGCTACAGTGGTTGGGCGGAAGGTGGCTCGGTTCCGCCCTCGCCCTCTACATCACGACTTGGCAGTCGCAAGTCATCTTTGACATCTATATCGTCTCTTAGCAGCCAATGCTCCGATCAACATGGTGTGTCAGGCTCAACAGTGAGCTGTTCAACTCCTATATCAACTGGATCGTCTATAGCACCAGCTTTGGACTCCCAGCGTGGACAGTCCACGTCTCGTCTGGCGAGCGTCTGTAGCAGTGACTCTGGTTTTCGTTCCCAAGACACTCTCCAGCGACCCTCTCTATACGTTGACAATGGTTCAGACAATCAAAGCATTAACAGCGAATGCGCGACACCGTCTAAGACAAATGACGAGAATGACTCCAACGATCTCACGTCCAACTCGGCTTCAGCCACGTGGCCAGATCTCAAAGACACGGCGCAGTTCGAGAGAGCGGCCTCCGCTATAATGGGGGGCAGACCGCATACTATATCAGCTGGTACGGATCGCTCGTTCCAACGGCCGGCGTTGAGTGTTCATACATTCAGCGAGAACGCCCGCGAGGGTATATACGCCAGGCCGCCTCTACCCACTGGTTCAGACAATCAAAGCATTAACAGCGAATGCGCGACACCGTCTAAGACAAATGACGAGAATGACTCCAACGATCTCACGTCCAACTCGGGTACGGATCGCTCGTTCCAACGGCCGGCGTTGAGTGTTCATACATTCAGCGAGAACGCCCGCGAGGGTATATACGCCAGGCCGCCTCTACCCACTCGCTGTTCATCGTTGGAGCGTCCCAGTGTGCCGGCGAGGAATTCCAATTCAACAAGAGCTGAGTTCAAACCAAATAAACCTACGAGCCTTCCGCCGCACCTCGCTAAAGAAGTACCACAAGCTTTGTACGTGAATATGTCAGAGTTGGCGACTATGGCGGCGTCACGAGCACAGCAACAGAACAGTACGGAGTATCCTCAACAGGAGAAGCGCTGTTCATCGTTGGAGCGTCCCAGTGTGCCGGCGAGGAATTCCAATTCAACAAGAGCTGAGTTCAAACCAAATAAACCTACGAGCCTTCCGCCGCACCTCGCTAAAGAAGTACCACAAGCTTTGTACGTGAATATGTCAGAGTTGGCGACTATGGCGGCGTCACGAGCACAGCAACAGAACAGTACGGAGTATCCTCAACAGGAGAAGGTATGTTCAGAGAGCAGTGCCAGCGAATCGTCCCTAGAATCATCGAGCGGCTATGGAAGTCAGGGCGCGTTCGCCGGCGAAGATCACTCTCATCTCATACACCCAGACGTGTCGTCGGAAATAGTGACACTACGTCACAGCGATTCCGTTGTGATCGCACGCGAGAATTTCTCGATATCACTTGGAAGTCTCGAGGAGGCCGTGAGATGTTTGGACGAGGCGTCTGAATCACCGACCTTCGCTACTATAGGCAAAAAACCAACCGTCCCAAAAAGAAGACCTGTATCAATGACAGGTGATATGCTTTGTGCTTATATACTGATGTGTTTGTACAACACAGTGTTGAGTGTCGTGAACTCGTCTGATGTTGACAACCTGCCTCCGCCGCCGGCCTTCCTCCTGCAGCCGGACGATGGGAACGCTCATCCAGGTATCAACGTAGCTGAGACGGTCAAACAGTTGACGGAATTGAAGCACATGCCGGCCTCGCCCGGTCTAGTGCGAAGGACGGTCCAAAACCAGAACGAGAACCAATCACCGATATACGGAACCCAGAAACAGTTGTACGGGACCCACAACCAGAATACGCCGATATATGGAACCCAGAATCAAAATCTCAGCTCGTTCCAGCAAGCTAAAAGCAATTTCTCATCCAGCATGAGCTTGAACAGCGGCAACCTGAACCCTATATACTCGCAGACAGGAAACAGAATGTTCGAGAACTCGCTGTTCGTTAGGAAGAACAGTTTGAACAGTTCTAACACAGATATATACGGCGGTAATACGGGTATATATGAAACTAAATCACCCCACGGCTCCACTCCATCAACACCGAGCTATTCTGAAACCAATACCTTCTCCAGCTTTGGGCCGAGGGTAACAAATGAATCTCATTACGGACAGACGGGTGTGAAGCTTCAAGATCAAAGCGGTATATACGCAGCGCCGTCGCTCGTTAGGAGGCTGACCCCGCGGGCTCACTCCGCTGAGAGAAACGCTAAAGAACACGGAGGTCTTATAGCGACTCTAAGTGCTAAGTTAGCACCACAACTATCTCCGAGGAGTCCGAGACGAGCCAATCCTAATGTCACTGAATCACCCGCTAAAGGGAAAGGTGTTCCAGCTACAGTACAGCCAGCGTTCTTGGACAAACTGTCAGCTACCTTACAACATCAGAGAAGGATGCAGTCTAGAGCGAACACAGTCAGAGATCTCATTAATGCGCATGCACAGCCGGATCCGAGGGTATGTCACACATCACTCATGGAACAAATCAAAAGAGGTGCGACACTCAGAAGAAATAAATATTGCAACGACAGATCAGCGCCCAAAATACGTTAA

Protein sequence:

>DPOGS206586-PA
MYVCLFVTGATKEIGTALTRVCLRHRAIETRMKTFISTLMERLIAPLSERADEWRRGCNVTGALSREHARECKRARAELRRRVHDAQRHARKARRTNPDVKRRADVCLQDIQERKQQLEEMEEKAVKAALIEERSRFCNFVSLLSPVVESEVAMLAEVSHLQEGTEQLSRQISEPRSLPPGSLQVICDIKSCYSGWAEGGSVPPSPSTSRLGSRKSSLTSISSLSSQCSDQHGVSGSTVSCSTPISTGSSIAPALDSQRGQSTSRLASVCSSDSGFRSQDTLQRPSLYVDNGSDNQSINSECATPSKTNDENDSNDLTSNSASATWPDLKDTAQFERAASAIMGGRPHTISAGTDRSFQRPALSVHTFSENAREGIYARPPLPTGSDNQSINSECATPSKTNDENDSNDLTSNSGTDRSFQRPALSVHTFSENAREGIYARPPLPTRCSSLERPSVPARNSNSTRAEFKPNKPTSLPPHLAKEVPQALYVNMSELATMAASRAQQQNSTEYPQQEKRCSSLERPSVPARNSNSTRAEFKPNKPTSLPPHLAKEVPQALYVNMSELATMAASRAQQQNSTEYPQQEKVCSESSASESSLESSSGYGSQGAFAGEDHSHLIHPDVSSEIVTLRHSDSVVIARENFSISLGSLEEAVRCLDEASESPTFATIGKKPTVPKRRPVSMTGDMLCAYILMCLYNTVLSVVNSSDVDNLPPPPAFLLQPDDGNAHPGINVAETVKQLTELKHMPASPGLVRRTVQNQNENQSPIYGTQKQLYGTHNQNTPIYGTQNQNLSSFQQAKSNFSSSMSLNSGNLNPIYSQTGNRMFENSLFVRKNSLNSSNTDIYGGNTGIYETKSPHGSTPSTPSYSETNTFSSFGPRVTNESHYGQTGVKLQDQSGIYAAPSLVRRLTPRAHSAERNAKEHGGLIATLSAKLAPQLSPRSPRRANPNVTESPAKGKGVPATVQPAFLDKLSATLQHQRRMQSRANTVRDLINAHAQPDPRVCHTSLMEQIKRGATLRRNKYCNDRSAPKIR-