Monarch geneset OGS2.0

DPOGS206932
TranscriptDPOGS206932-TA4329 bp
ProteinDPOGS206932-PA1442 aa
Genomic positionDPSCF300001 - 1059387-1070465
RNAseq coverage32x (Rank: top 75%)
Annotation
HeliconiusHMEL0143810.090.93% 
BombyxBGIBMGA012908-TA0.070.69% 
DrosophilaCG43073-PF0.064.14% 
EBI UniRef50UniRef50_E2ADJ60.048.73%Peripheral-type benzodiazepine receptor-associated protein 1 n=5 Tax=Eumetazoa RepID=E2ADJ6_CAMFO
NCBI RefSeqXP_001843285.10.054.07%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3479705040.056.76%AGAP003735-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1892386480.048.86%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00055153e-17protein binding
KEGG pathwayxla:4458511e-06 
 K13738 (CD2AP)maps-> Bacterial invasion of epithelial cells
InterPro domain[1088-1149] IPR0115112.1e-19Variant SH3
[1160-1253] IPR0014523e-17Src homology-3 domain
[663-731] IPR0089574e-07Fibronectin type III domain
Orthology groupMCL12564 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206932-TA
ATGTCGAGCGGTGGGGACTCTGACACGCTGTCTCGAAGGCTGCGTGAGGCGGAGCGCGCTCGAGCTGATGCTGAGAGAGCGCACGCGGACGCACTTGCACATCTTCGGGCGGCGCAGCGCACGCCGCACGACTCACACAACGTAGAACAGTTACAATCTAGAGTTAGAGAACTGGAGAAAAAGGCGGCATTAGAAACGGTTCGTTGCGAAGAACTGCAGTTAGAGCTGTCGGCAGCGATGCGTGCTCGTGGCACCACGACTACGGCAACCTGGTCGACGCCTGTGACTCAGCCAGCTTCCGAAATCGAGAGGATTATGGCTAAAATTGAACAAGATAACCGGATTTTAGCTGAATTGGAGCATTCTAGATCTACAACTCATGGGATTCAAACAAGCGGATCTAATCAAGCGTTGGGAGAGTTTCAGTCGCCGACACCATCTCCTTTGCCTCACTCATATACAAGCACCTCTGGCCACGTGGTGCTCTGTCAGAGCAACACGATGCCTACTCTCTCTTCTCTACACAACACTTCCCACTTCACCGCTCCCAGCTCTAATACAGTGGCGTATTCTATGAGTGCTCATAATCCTACTTCTTATTCTAATCCAGTCCCAGCTACTTACGGTCTGTCTAACGCTCATCAAGTTACTTTCACGAATCCAGTAACTGCTCCTCTCGTTAACAATATAAGTCAAATTTCGAGCACTCTTGCCGGTCTTCAATCTAATTTACAAAACATCACTGGCAATATGTCTGGGATGAATTTGGGTAGCACATTAGTCGGCCCAGCGATTAGTGGTACCTTAACAGGAGCAGGACTTTCTAGCTGTCTTAATAACCAACAAACAAATTTATCTGGAGGTCTTACAAGCACTCTTGCTGGAATTCAATCAAATTTGAGCGGTTCATTAGGAAACCCACTATCTAATCTCACAGGAACTGGACAAATCGGAAATCTTAATACAAGTTTGACTGGCACAAATGCATATGGAACGGGCGTAGGGACCTACACGAATCCATTATTATCCGGTGGTCTAGGAACAAACACAATGAATATTAAACTAAAACCCATTGATGAAATAGACCTTGGCTTAAGCAGATATGGTACTAATCCACCTTTAGTTAGAGCTACCACACCAGTAACACCGCACCAACCCTCTTGGAATTTAGGAATAGATAGCCAATTTGGAACTGATAGAATAGGAAATATGGATACCTTTATTCATGATCGAGGTCCGAGAGCCATAAGACTTTTACCAAATAATATGTTGGACGTAGATGTCCGAAACACGCATTACATGAATGGTGGCGCATCCTCTGAGCCACAAGTTGATATGCTAGACATTCCAGGAAAAGGTCGTTGCTGTGTTTATATTGCTCGATTCTCCTATGATCCGCCAGATGTCGAGACCTCGGAAGGAGAATTATCTATTTGCACCGGAGACTATTTACTGGTTTGGGGTCCACCGGATACCAATGCAACAATGCTTGATGCTGAGCTGTTAGACGGTCGTCGAGGTCTTGTTCCAGCAAATTTTGTACAACGTTTAGTTGGTGACGATCTCTTAGAATTTCATCAAGCCGTAATAGCTACTTTGCGAGATGCTGATGAAACTGCTACTACAGCAATAAGTGACATGTCGCTTAGCAGGGATGTGGCTCGTCTCAGTGAAATGGCTGATATAAGCGATGGTCCAGAAGATGATGACAATGTGCCAGCCCCGCGTCAGCTGACCCTGGAGCGGCAACTCAACAAGTCCGTGCTCATCGGCTGGACGGCGCCGGAGGGTGTGCAACAAGCCAACATCGACAGCTACCATGTGTACGTTGACGGCGTGCTCAAAACTACAGTCAAGGCAACCGAGCGAACTCGTGCTTTGGTCGAAGGAGTAGATGCGAATCGACCACATCGCATTAGCGTCCGATCAGTAACAGTCACTAGACGCACGTCGAGGGACGCCGCATGTACCATGGTCATAGGCAGAGACACTGGAAATATGGGACCCACTTGTGTTAGAGCGAGTGGAGTAACTTGCTCCCAAGCTGTAATTTCTTGGCTTCCAGCTAACTCAAACCACCAACATGTCGTGTGCGTCAATAATGTTGAGGTACGAACTGTCAAGCCGGGTGTTTATCGACATACAATAACGGGTTTGTCCCCAAGCACACAATATCGAGTTACGGTCCGAGCTAAGCATTTACGAGCAGGTCCACCAGGAGTACCTATTGAAGAAGCACCTGGAGCTTACACTGATTTTAGAACATTACCTAAAGGGCTGCCGGATCCACCAAACGAAATAATGGTGGAGGCCGGGCCACAGGATGGCACTCTCCTCGTGACGTGGCAACCTGTGCTGCGACCACCCGCCTCCGGGCCCGTCACCGGCTATGCGGTATACGCTGACGGCAAGAAGGTCACGGACGTCGATTCCCCCACCGGAGATCATGCTCTCATCGACATCGGCAAATTGATCGGCCTCAATCCCAAATGCGTTACTGTGAGAACCAAGTCGCGAGACAGTCAATCAAGTGACAGTGCTCCAACACCGATACCACCAGCTGTGCTTCGAGGAGTGGTATCTAGAGTGCCACGTGGTTCAGGTCAAGGGCCGACTTCAGGTCCAAATCAGCCAACTCCCGGTTTCAGAAACCAAAGACCTCAGCCATATCAACAACATCAGCAAGTTATCGAACACGACGAAAATCTGTCGGATAAGGAAATTTTTCCAACTACAAATCGCCATGAGGGTCAAGCAGCTCAGCCTACGAGCGGATTCGGCACAGGCCTTCTAAAGAGTATATTCGACAAAATAACCCCTTCGCAATCCGGGATACCGGCCATCGAAATCACTAAAGAGGGAGCCGTGGAGGCCAGTAGCGAGGACGAGGAACCACGCCGCCGCCCGCAGCAGCAGCCGTCCCAACCTGCTCAAGCGTCGCAACCGCCTTCGGGTCAGCAGTACCCGAATACACAAGCCTACCATGGCACGCAGCAACCCCCCTATCAACAGGGAGGCCAGTTCCCAAGTAATCAGCCAAGCCAATATCCCGGCCAACCGCAGCAGTATCAGAACCCTCCACCCCGAAATCATCATGAAAGAGGGCGCCCTCCAAATCCTCACCAGCAATTACAACAACAACAAATGCAACAAGGCCCTATGCAGCAATATGGTCCACGAGGAGGTCCTCCAATGGCTCCTAGAGGCCCCCAACCTGGTCATCGTCCTCCACAAGGACACCCTCAATCTAAAAGAAGTAGATTTTTCATCGCTCTTTTTGACTATGATCCGGCTACTATGAGTCCAAATCCCGAAAGCTGCGAAGAAGAATTGCCCTTCAGCGAAGGAGACACGATAAAGGTTTGGGGTGACAAGGACGCAGATGGATTTTATTGGGGTGAATGTCGTGGCCGACGGGGCTATGTACCTCACAATATGGTAGTAGAAGTTTCTGAACAGGAAGCCACCGGTCAGGCCGCAGCCAAGCCTCGGGATCGTTGGACGGAAGCTTACGCTAACCAGCCCGTGCGAAGAATGGTTGCCTTGTACGACTATGATCCTCAGGAGCTCAGCCCTAATGTCGATGCTGATGCGGAGTTGAGTTTTCAAACGGGTCAAATTATCCACGTTTATGGAGAAATGGACGACGACGGATTCTATATGGCAGAGATCGACGGTGTCAAGGGTCTAGTGCCCAGCAACTTCCTTACTGATGCTAATGACAATTATGGACCACCGGGACAAAGTAGCCAAGGTATGATGGGCGGCGGCAGAACCGGTATTCCGGGTACAAGTATGCCTGTCGGGGGACAAGGCCGTGGGCGAGGGGTACCACCCGGGCCTGGAGCTCGCGGTCCTCCTCCTCCACCACGCGAAAACGTAGCCCCTGCACCCAGAAACCATCATCGCAAAACAGATGCCTGCCCTCTTCCCCCTTCTCAGTTAGACCACAACACATGCGCCAGTATTCCAGAACAGGCGAATTCTCAGGGGAGGGGGAGAGGCGCGGTGAGCGCTCCTCAAGTGAGCACGATCGCGCGCACGAGCGCTGGTAATGTGAGCACGCCGACCGTGCAGTCTCCAGGATCGGCCGCCATTACCGGCATGCCGCCCATCGGCACAACGACTGCCGTCAGCACGTCGCCCGCCCGTCGAGGCGGGCCTGCTGTCGTGCCAGCACCGACCGCTCAGCCAACAACACAAGCCCCGCCTCCGACCACTCAGCCCAACCTCATGCAGAAGTTCTCCGAGATGACTGCCCCCGGTGGTGACATTCTCAGCAAAGGGAAGGAGCTCATCTTCATGAAGTTCGGACTCGGTGGAAAATAA

Protein sequence:

>DPOGS206932-PA
MSSGGDSDTLSRRLREAERARADAERAHADALAHLRAAQRTPHDSHNVEQLQSRVRELEKKAALETVRCEELQLELSAAMRARGTTTTATWSTPVTQPASEIERIMAKIEQDNRILAELEHSRSTTHGIQTSGSNQALGEFQSPTPSPLPHSYTSTSGHVVLCQSNTMPTLSSLHNTSHFTAPSSNTVAYSMSAHNPTSYSNPVPATYGLSNAHQVTFTNPVTAPLVNNISQISSTLAGLQSNLQNITGNMSGMNLGSTLVGPAISGTLTGAGLSSCLNNQQTNLSGGLTSTLAGIQSNLSGSLGNPLSNLTGTGQIGNLNTSLTGTNAYGTGVGTYTNPLLSGGLGTNTMNIKLKPIDEIDLGLSRYGTNPPLVRATTPVTPHQPSWNLGIDSQFGTDRIGNMDTFIHDRGPRAIRLLPNNMLDVDVRNTHYMNGGASSEPQVDMLDIPGKGRCCVYIARFSYDPPDVETSEGELSICTGDYLLVWGPPDTNATMLDAELLDGRRGLVPANFVQRLVGDDLLEFHQAVIATLRDADETATTAISDMSLSRDVARLSEMADISDGPEDDDNVPAPRQLTLERQLNKSVLIGWTAPEGVQQANIDSYHVYVDGVLKTTVKATERTRALVEGVDANRPHRISVRSVTVTRRTSRDAACTMVIGRDTGNMGPTCVRASGVTCSQAVISWLPANSNHQHVVCVNNVEVRTVKPGVYRHTITGLSPSTQYRVTVRAKHLRAGPPGVPIEEAPGAYTDFRTLPKGLPDPPNEIMVEAGPQDGTLLVTWQPVLRPPASGPVTGYAVYADGKKVTDVDSPTGDHALIDIGKLIGLNPKCVTVRTKSRDSQSSDSAPTPIPPAVLRGVVSRVPRGSGQGPTSGPNQPTPGFRNQRPQPYQQHQQVIEHDENLSDKEIFPTTNRHEGQAAQPTSGFGTGLLKSIFDKITPSQSGIPAIEITKEGAVEASSEDEEPRRRPQQQPSQPAQASQPPSGQQYPNTQAYHGTQQPPYQQGGQFPSNQPSQYPGQPQQYQNPPPRNHHERGRPPNPHQQLQQQQMQQGPMQQYGPRGGPPMAPRGPQPGHRPPQGHPQSKRSRFFIALFDYDPATMSPNPESCEEELPFSEGDTIKVWGDKDADGFYWGECRGRRGYVPHNMVVEVSEQEATGQAAAKPRDRWTEAYANQPVRRMVALYDYDPQELSPNVDADAELSFQTGQIIHVYGEMDDDGFYMAEIDGVKGLVPSNFLTDANDNYGPPGQSSQGMMGGGRTGIPGTSMPVGGQGRGRGVPPGPGARGPPPPPRENVAPAPRNHHRKTDACPLPPSQLDHNTCASIPEQANSQGRGRGAVSAPQVSTIARTSAGNVSTPTVQSPGSAAITGMPPIGTTTAVSTSPARRGGPAVVPAPTAQPTTQAPPPTTQPNLMQKFSEMTAPGGDILSKGKELIFMKFGLGGK-