Genomic Position | scaffold108:- 92070-100770 |
---|---|
See gene structure | |
CDS Length | 3807 |
Paired RNAseq reads   | 683 |
Single RNAseq reads   | 3215 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004903 (0.0) |
Best Drosophila hit   | CG2698, isoform A (2e-28) |
Best Human hit | transmembrane protein C9orf5 (5e-27) |
Best NR hit (blastp)   | PREDICTED: similar to CG2698-PA [Apis mellifera] (3e-90) |
Best NR hit (blastx)   | PREDICTED: similar to CG2698-PA [Apis mellifera] (3e-93) |
GeneOntology terms    | GO:0008150 biological_process GO:0003674 molecular_function GO:0016020 membrane GO:0016021 integral to membrane |
InterPro families   | IPR002549 Uncharacterised protein family UPF0118 |
Orthology group | MCL11974 |
Nucleotide sequence:
ATGCAGCAATCACCGTTTGAAAATATTTTTAATATTATTGGTGGCCTATCAGAGGGTAAT
GAGAAACCTATGAAACATGGTTTCTACAATGCTTTCGCCCTTTTCATCCTAACGATTTGT
TGCACAGCAGTTTATGTGCTATTTTTAATTTTGGAACCTTTTTTTAAACCATTGTTTTGG
GCTCTATTAGTGGGTTCTTTATTACATCCATTTAAATACAAATTATCAAAAAAAATTAAG
ACATGGTTCGAAGAACTGGAGAAATCGAATTCATCTGTGATATATGGTTTAATGGTGATA
CCGGTGGATGTTATAAATTATGCTTCCGATACTGTTGGTTATCAATTCAGAGATCATTAT
AAGACAATGACTTTTGCTTTGGCATATATGATCAGCATATATTTCCTGTGGAGTCCGTCT
AGATCGGGACTATTCAAAGTTACCTCTCATATTCTCTGGCTCATCATTGCTAGTTTCTTG
GCAAACTTGGCCGGCTCTTTCAAAGTTTATGTGTTCGTACTGTTACAAGCTATTGGTATA
ACTGGCTTTGTACTAGAAGTCCTTCACATACATAAGCATTTGTCACAAAAAGATCCTGAA
GCATCATTGTTTACTGCATTGCAACATGCTTTAAGCAACAGACCACTAGAACATACTGTC
CTTGAAAAGATTGATGAGAACCCTGAAGAATCTAGCCCTGAGGATGTGAGTGACAAAGAA
AAAACTCCTGATATATCGCCAGTGACCCCAGAGGTTGCGAAAAGGGGGAGCTGGTCAGAA
AATGATCAATTAAGAAGCTCCAGACAGGCCAAAATGCTATTTAAGACAAGAACCCTCTCA
GCACTTCCATTACAAAATGTTAAAACGAAATCTGCCTTCGAATCTCGACTTCTAGCCTCA
TTCAAACAGAGGTCCTTGGATGAGAATTATATACGAAATAATTTCAATGATGACGAGACC
GCGCTATACTTCAAACTTCTATTTTTTGGTTGCTTTTGTATGTTGATATGGAAATACATA
TGGCTACTGCCAGTCATGCTGTTCTTCCTAGCCATCCACATCATAAAGATATCGCTCAAC
TTCTTTGGGGTTTGGTTATTTTTTGAGAATCAATGCAACAACCTAACAGCTAAAATTGGT
GGATGGTGGAATGACAGAAAATCTGCTGTCATCCCGGCACAGATACGAGGCATCGGGAAG
ATGTTGTCTCTGTGCAATAAGAGTGTTATCGAAATGGCATACGAGTCAATCGATACAGTG
TCCACATATAACGCATACACATATGGTAGAGAAGGAATATCTACCGGGGTGAAGAGCATC
CTAAAAGAGGGTGATCCAGAGAAGATAGCTCGTGTGGAACAACAGGTCCTGGAAATATGG
GATCGCGTGTATCAGAGCTGGGTGTCAGGAATGTTCGCTCAGAGTGGACCCCAGATGGAT
GGGTCCGCGGTACAAGAGTCCTGGGATAACTTTGTGCTAGAAGCTGGAACCTCTGGAGGT
GTGTTGGACTACACGGGTCTTGTGTCGTGGGTCAAAGCCAATGTGGGCACGTTGAGCGCG
ATCCTGTCCGGAGTTTGGGCGCCCCTGGCCAGCAACGTCTCACTGTTGGCTGGTTCCCTT
GGCGCCTTCACATCCCTACTGCTCTGTGGCGGGGGCGCTATAATCAATATGTTTATTAAT
ATGGTGGTGTTCTTCACGACTCTGTTCTATCTGCTGGCATCCAGCAACGCTCTCTACAAG
CCGGTTGAGGTGATAACACGTGTTCAACCGAACTTCGGCCCCAGGCTGGGAATAGCGCTC
TCTGTGGCCATTAATCAGGTATTCCGCGCCTCGTTCAAGATGGCGTTGTTTTACGGTCTC
TGGACGTGGCTAGTTCACAACTTGTTCGGAGCTAAAGTTGTTTACTTGCCCTTAACTAGT
GGGCTTAGGATCAACTTCACAAACTGCCAACACCCTGAATGGACGTGCCATAACAATGCA
CCGAGTAACATCACTTACGTGTGTCCGGCTAGGAGTAATGATAACATAGTTCATCTCAAA
GATTTCTACACGGATCACATCCACAGTTTGACGATAAAGAACTGTAGGGATTTAAAGGTG
GTTTTGGACTGCCCTATCCTACAAAGGTCATCCCGACTTCAAAAACTGGCTATAAAGGAT
TGCACGAGACTAGAATTCATATCCCTATCGAGTTCGTCCCTGCTACAAACACCACCAGAA
GTTTACATTGAGAATATTAAAGAAATCATATCTCTGCCCAGAAATATGTTCAAATCACCA
AAAACTATCACAGAGACTAAATGCATGGGGACGTCATCTTTGAAATGCATTCATTTTGCA
AATTCCAAAATAAATACAATAAGTTCAAAGGCCATTCACAACATAAGCGGCTGTAGAAGT
GTTGAGTTCGACAATGTCACTATCAATAGCATCCAAAGCGAAGGCATAGAGGCTATCTTG
GGGAACGATAATACGCTATTTGAAATGACCAATACTAAAGTTGATGGTATTGATTACAAA
GGTATAACAGTACAAAGTACAACAGCTAGGCTATCGTTTAATAGCTTCGGTGAGGTCGTA
GCGGGTGTGGTGAATGTTACGTCCGATAAATTATATATAGTTGGTAATTCTTTTAGAACA
ATACACGGCAATGGATTGGTGACGAACTCGATACACACAGATATAACGGACAACATAATA
GGAACGCTGAAAACTAACGCGCTTAGCAACATCAAATGTTCCAAACGTCGTTCAACAAAG
AAACACATGAATTTCATTAGGAACACCATCGAGCGGGTTGAACCTTACTCGCTGTACCTG
GACTATGCCAGTTGCAAAACGGCTGTGACGCAGGTCGTTATACGAGAAAATAAAATGGAC
TGCAAGTGTCGGAACATAGCGTATCTAAACTCACAGACGAATGTTGAGCAAAACAACTTA
ATCTTGAATCTGAGCAACAACAATACGTGTCTGATGACGTCATGCGTGCTGCCGGTCGAA
GTTGTGAAGCTGTTATTAGAGAGCGACATGTGCCACTTGAATCTCGACACACGAGTGATG
TGTCTGCTTTATAACGATAAGCATTCCAATAAAAACGAGGTGATTACGGACGAGGACGTC
ACAGAGCCGGCACCCACGTTTTACTTGATAAGACAAGCTAATAATCTACAAGAGGGTTCG
GCGGCTATGACCGCCATCGATAAGGAGGATCTGTTGAAAGACACGCATTTAAACATGACG
AATAGAACCACGATCAAAGTCGTGTTCGATTCCTCTAAAGATTTTGTGGAAACTCTCCGA
AGTACGAGTAGGACCAGAAACAGGCCTACAGAGCACAGTCCGCCCAAAGATGAATACGTG
TCCCGATGCATAGGAACTCAGTGCAGGAACACCGCCGCCTACAATAGACAGAGGGCGCTC
GATTTTTACAAATACGTATACGCACAGTTACTTCTAGCAGCAGTGTTAGGCGCAGCTCCA
TTCTTAGGGCCATACCTGGCTGGTATCCCCGCAGCATTGGATGTCTGGCTCCAAGGTCGA
CCGATGGCAGCCCTCCTGTTACCAATCGTACAAGCAGCTCCTATAGCCTTTTTGGATGCT
GCGGTTTACGCAGAAATTAAAGACGGTGGTCACCCTTACGTGACGGGTCTTGCGATAGCT
GGCGGTATATTCTACCTAGGGCCGGAGGGCGCCATACTTGGGCCGTTGTTGTTATGCTGT
CTGATGGTAGTGTTCAATCTGTCCTCGACCTTCCTCAGGGACACTCCCTCGGAAGAAAGA
GCCGCCTTACATTCGCGTGTCAGGTAA
Protein sequence:
MQQSPFENIFNIIGGLSEGNEKPMKHGFYNAFALFILTICCTAVYVLFLILEPFFKPLFW
ALLVGSLLHPFKYKLSKKIKTWFEELEKSNSSVIYGLMVIPVDVINYASDTVGYQFRDHY
KTMTFALAYMISIYFLWSPSRSGLFKVTSHILWLIIASFLANLAGSFKVYVFVLLQAIGI
TGFVLEVLHIHKHLSQKDPEASLFTALQHALSNRPLEHTVLEKIDENPEESSPEDVSDKE
KTPDISPVTPEVAKRGSWSENDQLRSSRQAKMLFKTRTLSALPLQNVKTKSAFESRLLAS
FKQRSLDENYIRNNFNDDETALYFKLLFFGCFCMLIWKYIWLLPVMLFFLAIHIIKISLN
FFGVWLFFENQCNNLTAKIGGWWNDRKSAVIPAQIRGIGKMLSLCNKSVIEMAYESIDTV
STYNAYTYGREGISTGVKSILKEGDPEKIARVEQQVLEIWDRVYQSWVSGMFAQSGPQMD
GSAVQESWDNFVLEAGTSGGVLDYTGLVSWVKANVGTLSAILSGVWAPLASNVSLLAGSL
GAFTSLLLCGGGAIINMFINMVVFFTTLFYLLASSNALYKPVEVITRVQPNFGPRLGIAL
SVAINQVFRASFKMALFYGLWTWLVHNLFGAKVVYLPLTSGLRINFTNCQHPEWTCHNNA
PSNITYVCPARSNDNIVHLKDFYTDHIHSLTIKNCRDLKVVLDCPILQRSSRLQKLAIKD
CTRLEFISLSSSSLLQTPPEVYIENIKEIISLPRNMFKSPKTITETKCMGTSSLKCIHFA
NSKINTISSKAIHNISGCRSVEFDNVTINSIQSEGIEAILGNDNTLFEMTNTKVDGIDYK
GITVQSTTARLSFNSFGEVVAGVVNVTSDKLYIVGNSFRTIHGNGLVTNSIHTDITDNII
GTLKTNALSNIKCSKRRSTKKHMNFIRNTIERVEPYSLYLDYASCKTAVTQVVIRENKMD
CKCRNIAYLNSQTNVEQNNLILNLSNNNTCLMTSCVLPVEVVKLLLESDMCHLNLDTRVM
CLLYNDKHSNKNEVITDEDVTEPAPTFYLIRQANNLQEGSAAMTAIDKEDLLKDTHLNMT
NRTTIKVVFDSSKDFVETLRSTSRTRNRPTEHSPPKDEYVSRCIGTQCRNTAAYNRQRAL
DFYKYVYAQLLLAAVLGAAPFLGPYLAGIPAALDVWLQGRPMAALLLPIVQAAPIAFLDA
AVYAEIKDGGHPYVTGLAIAGGIFYLGPEGAILGPLLLCCLMVVFNLSSTFLRDTPSEER
AALHSRVR