Monarch geneset OGS2.0

DPOGS201245
TranscriptDPOGS201245-TA1782 bp
ProteinDPOGS201245-PA593 aa
Genomic positionDPSCF300037 + 67437-76084
RNAseq coverage1421x (Rank: top 9%)
Annotation
HeliconiusHMEL0032112e-12973.02% 
BombyxBGIBMGA012467-TA0.073.87% 
DrosophilaCAP-PT3e-9545.98% 
EBI UniRef50UniRef50_C9DTM42e-17074.48%CAP isoform A n=3 Tax=Obtectomera RepID=C9DTM4_BOMMO
NCBI RefSeqNP_001166801.14e-17174.48%c-Cbl-associated protein isoform A [Bombyx mori]
NCBI nr blastpgi|2905634379e-17074.48%c-Cbl-associated protein isoform A [Bombyx mori]
NCBI nr blastxgi|2905634371e-16453.38%c-Cbl-associated protein isoform A [Bombyx mori]
Group
Gene OntologyGO:00055152.1e-21protein binding
KEGG pathwayptr:4506283e-49 
 K06086 (SORBS1, SH3D5, PONSIN, CAP)maps-> Insulin signaling pathway
    Adherens junction
    PPAR signaling pathway
InterPro domain[329-384] IPR0014522.1e-21Src homology-3 domain
[400-451] IPR0115113.1e-13Variant SH3
[344-363] IPR0001081.6e-07Neutrophil cytosol factor 2 p67phox
Orthology groupMCL11098 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201245-TA
ATGAATCAAAACCGCTTAAGCGAACCAGTTTTCGGTAAAAGTTCTGTGTCGGATTATTATTATGATGAAGAAATAAATTTTAGAGATCTTCGACTATCTGGCTCTATGTCCAGATCGTTAAGGGCATTGAACAATTATGTGAAGAAAAACATTTTTTACGATGAAAGTGAGGACCCAGAAAATGAACACGATAGTGATTATTATTCCAAAGAAACAAGCCCAAACGATTCGGTCTCTTTAGTGAAAAGTGATTCAGAAAGTGTTTTGAGCGAAGTGTCAAGTGTAAAAGATAAAATAAATAACATCCCTACTCATGCAACGGAGGACTCGCATTCCAAAACATCTAACGACCCCACAACTTTAACCGATGACTATAAATTCAATAAGCTTATTGAAAATGCTTCAAAAACTGAAACACCTCTTGTGATAGAAAATTCAAATGATATAAATAATAGCAAAGAACTTAAAGAGCCTCCCGTGCCTAGTACGCGCCATTCACTATCAACTCCTCTCATAGATGAATCTGAGGCAAAGGAAAAAGATGTTGACCTTGAAAAAGTTACGCTAAGGACAAAAAATATATCAAGAGATAATCGACACACCGTGCACGATGTATCGGAATGGGTAAATAGAACTGATATTTACCCGGATGTTTACGCGCCCCTGCCGTATAAATCTCCTAACCGCCGCTACATTGAGAGCGACGTGAACATCCATTACCGCTGTCCGGTGAGACACGACCCTCTGCCGCTAGTCCCAGAACGCGAGCTGGCGAGGCAGCAGGCCGATCATATGAAGCGGCTGTACAGGGAGCAGCGCAGGAACAAGTACCTACAGGAGAACTCCATACATCCGCTCGACATAGAAGCATTCGATGCCTCTATCAAGGAGCTACAAGATATGCAAAATAGGCGTCATCAAGACAACTTTATGCCTTCACAAAAAACTATAGTCCCACTAAATAGATACGACGAAGCAGAAAGAATAGTCGCTAAAGCGCTCTATACATTCAATGGTCAGACCTCGAGAGAATTGAGTTTCAGGAAAGGAGATATTATAAATGTTAGGCGACAAATAGATTCTAATTGGTACGAAGGAGAGGTGCACGGAAAAGTCGGATTATTCCCATACAATTACGTAGAATTAATGAAAGGGGATGGGATTCAAACTTTGAAGAAGACGGCGATAGTCGAGGGTCGAGCCAAAGCAAAGTTCGACTTCACCGCCCAGACCAACCTCGAGCTGCCGCTGAAGAAGGGCGAGGTCGTGGTGCTGACGAGACGGATCGATCATAACTGGTGGGAAGGAAGAACTGGCAATAAGACCGGCATCTTCCCTGACAGCTACGTCACAATACTACAAGAACCGAGTCAAAGTAGACAGGAGCCGGAAAAGCCAGTCGGTACTCCAGCGGCCCATGGTCTCATGAACGGTGACAGACCCACGTCACATCGCTACACTCCTCAACATAACAGTCCAGCTCTCTCTAACGCACCCCCCGCCACAGCGCCGCTACCGTCGCAAGGCTACATTCGCAAGTCTTCATCTACCCGCAGTGCTGACCTTAACAACACAGAGCCTCTTTACGTTGACACCAACGCTGAAGCTGTTCCTTACCGCGCCATGTACAAGTATCGTCCCCAAAACCCTGACGAGCTGGAGTTGTTGGAGGGGGAGACGGTGTACGTCCTCGAGAAGTGTGATGATGGATGGTATGTCGGCTCCAGCCAGAGAACCGGCCGGTTCGGTACCTTCCCCGGCAACTACGTAGAGCGTATATGA

Protein sequence:

>DPOGS201245-PA
MNQNRLSEPVFGKSSVSDYYYDEEINFRDLRLSGSMSRSLRALNNYVKKNIFYDESEDPENEHDSDYYSKETSPNDSVSLVKSDSESVLSEVSSVKDKINNIPTHATEDSHSKTSNDPTTLTDDYKFNKLIENASKTETPLVIENSNDINNSKELKEPPVPSTRHSLSTPLIDESEAKEKDVDLEKVTLRTKNISRDNRHTVHDVSEWVNRTDIYPDVYAPLPYKSPNRRYIESDVNIHYRCPVRHDPLPLVPERELARQQADHMKRLYREQRRNKYLQENSIHPLDIEAFDASIKELQDMQNRRHQDNFMPSQKTIVPLNRYDEAERIVAKALYTFNGQTSRELSFRKGDIINVRRQIDSNWYEGEVHGKVGLFPYNYVELMKGDGIQTLKKTAIVEGRAKAKFDFTAQTNLELPLKKGEVVVLTRRIDHNWWEGRTGNKTGIFPDSYVTILQEPSQSRQEPEKPVGTPAAHGLMNGDRPTSHRYTPQHNSPALSNAPPATAPLPSQGYIRKSSSTRSADLNNTEPLYVDTNAEAVPYRAMYKYRPQNPDELELLEGETVYVLEKCDDGWYVGSSQRTGRFGTFPGNYVERI-