Monarch geneset OGS2.0

DPOGS208886
TranscriptDPOGS208886-TA2226 bp
ProteinDPOGS208886-PA741 aa
Genomic positionDPSCF300009 - 1098332-1111502
RNAseq coverage582x (Rank: top 22%)
Annotation
HeliconiusHMEL0089000.065.97% 
BombyxBGIBMGA002456-TA1e-13674.20% 
Drosophilacindr-PC5e-3544.10% 
EBI UniRef50UniRef50_B0WJD95e-8338.93%Dab2-interacting protein n=3 Tax=Diptera RepID=B0WJD9_CULQU
NCBI RefSeqXP_001848823.19e-8438.93%dab2-interacting protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700422022e-8238.93%dab2-interacting protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700422021e-9833.49%dab2-interacting protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00055156.8e-23protein binding
KEGG pathwaytgu:1002250645e-28 
 K12470 (SH3KBP1, CIN85)maps-> Endocytosis
InterPro domain[153-264] IPR0014526.8e-23Src homology-3 domain
[161-213] IPR0115112.7e-13Variant SH3
[11-31] IPR0001089e-06Neutrophil cytosol factor 2 p67phox
Orthology groupMCL16126 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208886-TA
ATGGAGACTAATACAAATCAAAAAGTTTCATGTATAGTGAACTACTCCTATGATGCGTCTGAGCCCGATGAGTTGACGATACGTCCTGGTGATGTGTTGAGAGATGTGGAACGTCTGCCCGGTGGCTGGTGGAGAGGTGAACTACGTGGTCGGAAGGGAATGTTCCCTGATAATTTTGTCTCTGTTCTCACTGATCAGAATAATGCAAGGCCGAGTAACGTGCAAGGTCGTTGTCGAGCGGTGTACAGTTATCAACCGGCGAACCCTGATGAGCTCCCGCTTTGTGTTGGTGACGTTTTGGAGGTCTTGAACGAGGTCGAGGAAGGCTGGTGGAAGGGTCGTAGGTCAGGTCGTGTGGGCGTATTCCCGTCCAACTTCGTTGTCATGTTAGAGACGAGCCCCACGCCCGCACCTCCCCTACACCCAGCACCGCCGCTGGAACCAGCGCCCGCACTGCCGCCAAAACCCGTCAAAGAGTTATGCCGCGTTCTGTTCCCATATACAGCGGTGAACGAAGACGAATTAACTCTATCCGAAGGTGACATAGTTAGTATCGTGTCAAAAGAAGCACCGGATAGAGGCTGGTGGAAAGGGGAACTTCACGGACGAGTAGGCTTCTTCCCGGATAATTTCGTGCAGTTGTTACCGGCAGTGGCCCAGGAGGTTGAGGAGAAAAAGCCGGATAGACCATCGTCGAAGACAAACTCACTGTATCCAGTACTGAACAAGTACTCAGAGAAAACAGCGTCTGTTAAGGAGCTTACAACATCTAAAATAAGCATGCACAAAGAGAACACGGTCAGTAAGGAGAGCAGCACTCAGAGTACCGTCACACACACAACCAAGAATGAAACGGAGAAGATTGTACACAATGAGACTGTTATAGACGGATCTACAGACACACAGGTCAAGAAACCTCCGGTGCCGTTAAAGAAGAGTCCAACACCCGGTACTGTGGGTGGTTTGTTCAGCGGACTTAAGAATAAGATGCTCTCATCCTCCGATAAAACAGAAAAAACAGTATCAAGTAGTGTAACGGCGTTCGATCGCACCGATGGTATAACGTCGAGCAAGGCGATTGTCAACGACACCAACAGTTTTGATCACGTGGAGAGGAACTCCATACTGAATGATCCTAGAGCTGGCAGGGTAAAAGCCCCTCGTCGACGTCCTCCCAGTCAGGCGCTGAGAGACGACACTCTTCAACAGATCGGTCTCAGTAATGGTCATGAAGCACCGATCGAGAAATCTCCTGAGAAGGAGGAGATCCGACCAAAAGCACGGGACTGGGAGAAACACCGAGCGCCGTGGATGGAGGAGCTCAAGTTAAACCAGGCGAGGAAGACCAGCGGGGACGGGAAGACGCGTCCATCAGAGGCTGGACTTAAATCTGGTGGCAGTACTGACAGAATTTTGGAATGTGGCTTAGAAGACAAGAGATCTTCACATTCAATGGAAATGTCTAAAAGCACGTCGTCCATTAGCAACAGAATATCGGTTTTAGAAAATATTAAATCTACCACTGAGACTAAGACACAAGTGAGTGAGTCAAAGAAAGAATCCTTACCACCTGAACTACCAACAACAAGTCCACCTTCATTGCAATCTCCAACAGCCGCTGGACGCACTATATCTGCCATGTTTGAAGAGGTTCGCAAGCCGCCAGCTCCCGCACCGCCTATAGCCGCTCGACTGTCCGATCATGAGAAATCCAAACAGGACAAGACAGACAGACACTCTGACAGGTTCTCAAGCCTAGATGGTTCGGACAAGAGTGACGTCACAGAGGTCACTGACCCCTCCAAATACCCCAGCTTGGACAAATCGACTGATAAAAATAATGAAATCAACAAACTAGCAGCTACGGCGGATAGGTTCAGTGACATTCAGATATTTGCGACTGAAAAGACTAGTGTTAAGAATGAAGCTAGTAAATTTAGTATCATTGAAAAATCGACCGCGGCCATAGAGAAAAATGAAAAGACGGAAATTCTACGTTCCAGTACATTAGAAAGGCGGCCTAAGAATGATAAGGAGTGCTCAGAGGCTTTGATTGCTCAATTAAATAATAGGATACTTAACCTAGAAAAGTTAATAGAAGTACAAAACGCTAAATTCAACACTGCCATCGAGGATCTGTCGAATAAGTTGAAACAGGAGACGGAGAAACGACAGGCGTTACAAATGGAAATAGAGAAGCTAGCTCACTGTGTTACACAAGTCTAG

Protein sequence:

>DPOGS208886-PA
METNTNQKVSCIVNYSYDASEPDELTIRPGDVLRDVERLPGGWWRGELRGRKGMFPDNFVSVLTDQNNARPSNVQGRCRAVYSYQPANPDELPLCVGDVLEVLNEVEEGWWKGRRSGRVGVFPSNFVVMLETSPTPAPPLHPAPPLEPAPALPPKPVKELCRVLFPYTAVNEDELTLSEGDIVSIVSKEAPDRGWWKGELHGRVGFFPDNFVQLLPAVAQEVEEKKPDRPSSKTNSLYPVLNKYSEKTASVKELTTSKISMHKENTVSKESSTQSTVTHTTKNETEKIVHNETVIDGSTDTQVKKPPVPLKKSPTPGTVGGLFSGLKNKMLSSSDKTEKTVSSSVTAFDRTDGITSSKAIVNDTNSFDHVERNSILNDPRAGRVKAPRRRPPSQALRDDTLQQIGLSNGHEAPIEKSPEKEEIRPKARDWEKHRAPWMEELKLNQARKTSGDGKTRPSEAGLKSGGSTDRILECGLEDKRSSHSMEMSKSTSSISNRISVLENIKSTTETKTQVSESKKESLPPELPTTSPPSLQSPTAAGRTISAMFEEVRKPPAPAPPIAARLSDHEKSKQDKTDRHSDRFSSLDGSDKSDVTEVTDPSKYPSLDKSTDKNNEINKLAATADRFSDIQIFATEKTSVKNEASKFSIIEKSTAAIEKNEKTEILRSSTLERRPKNDKECSEALIAQLNNRILNLEKLIEVQNAKFNTAIEDLSNKLKQETEKRQALQMEIEKLAHCVTQV-