DPGLEAN02086 in OGS1.0

New model in OGS2.0DPOGS209889 
Genomic Positionscaffold82:- 81377-84688
See gene structure
CDS Length3312
Paired RNAseq reads  1410
Single RNAseq reads  3902
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000187 (0.0)
Best Drosophila hit  gryzun, isoform B (7e-141)
Best Human hitfoie gras isoform a (0.0)
Best NR hit (blastp)  PREDICTED: similar to FLJ12716-like protein [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to FLJ12716-like protein [Tribolium castaneum] (0.0)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families
  
IPR021773 Foie gras liver health family 1
IPR012880 Domain of unknown function DUF1683, C-terminal
Orthology groupMCL13444

Nucleotide sequence:

ATGGCGACTCAGCCGAGTGACAACACAGAATTTCCGCCTGAAATCATTCTTAAGCCCCTT
GCCTTGATAGGGTTGTCAGGGCTTGATACGGTGAATAATGCAATCCACAAAGCTATATGG
GATGCCTTCTCAAACAACCGCCGACCGGACCGAGCCGCCGTGAGGTTCAAGTTGTTGAAT
AACACGTTCGAATTTCCAGTGGTGAAGCCTAAGAGAAACTCCTATGAATGGTACATCCCC
AAAGGTATATTGAAGAAAAACTGGATAACTAAACGTGTATCGTTAATTCCGGCCGTGGTG
GTTATTTTTTATGATATGGAGTGGAATGATCCTCAGTGGAACGAAAAGATCATCGAGTGT
GCGTCGAGAGTGCAGTCGATACGGGCAGCGGTGGAGGGACACGCTACACGTGTCGCTGTT
GTGGTTGTACAGAGTGGACTTTCACCCCCACCATCGGAGTACATGCTCGGTGCTGAAAGA
GCACAGGCACTTTGCTCAGCATGTGAAATACAATCTAAGTCTCTCTTTGTACTCCCTCAC
AGCGATCACCTCATGGGTTACATTATAAGACTAGAAAATGCTTTTTATGATATTGCACAA
AATTATTATCATCACGAAACCAAGAACATCAAGCAGCATAGAGATCATCTGAATAAGACT
ACCCATCAGTATTTGTTTGTTAGACATCAGTTCAAACTAGGCTTCCTCAATGAACTCAAG
CAAGACATAAGCACGGCCCACAAACACTACATGCATGCATATAACAACCTCCTTGATACC
AGACAAGTAGATACTAATGTACATGAAATACGAACCGTGTCTGGTTACATCAATTATAAA
CTATGTAAGCTGCTGTTTGCCTTAAATTTGCCACGAGATGCAATTGCACAAGTTAAGTCA
CATATAGAGCGCTACAAAAACAGAATTGGACCCACTGAACTGTTGTTTGAGCATTATGGC
TGGATTGCCAGGCAGTATAGTGCCTTTGGAGAATTATTTGATGAAGCTATAAGGTTAGGG
CTTCCGGCAATTCAATCCCAACATCCTGGCTTTTATTACCAGTATGCAGCTCAATTTACA
GTGAAAAGACGGCAAGCCATGAGGTCGGTATGCTGTGATGCTTCACACTATCCACCTGCC
CCAGACCCCATGGAGGGTATTGTGGAGTTTTATGGCCAGAGACCTTGGAGACCGGGACGA
CTCAGTGCGGATCCACATGATCCACAAAAGGAACAAGCGGCGGTGTTGGCACTGCAATAC
AATGAAAGAATTTTCAACCATTCTGCTATGATAATTAGTTTTCTAGGTAGTGCTATCTCC
CAGTTTAAAACATTTCACTCGCCCAGAATGAGGAAGCAGTTAGTGGTTGAGATGGCTAAT
GAATATTATTTTTGTGCGGACTATGGTAAAGCTTTGACTTTATTGTCTCATATGCTCTGG
GATTATAGAAAAGAAAAGTGGTGGTTTTTGGCTTCCCATGTCTTAAACCGAGCTTTACAA
TGTGCCTACTTGTCTGCAAAAATTCAAGACTACATTCATTTATCAGTGGAGGCACTCTCC
AAATACATTCAAGTGCCAAACAACGACAAAGATAGAATATTTAGAAACATAATGGCAGTT
CTCAACATGAACATTCCATCACCAGAGCCGAACCTCCCTCCTTCCTCACAGAGTAAAGCA
TTAGAAATGTGGCAACTGGCTATAGACAAAGAGCCTCTCACCATTGCCATAGATATGATA
AACATAGCCAGTTTCCTAGAAGTAAAAGCAAAGTTCAAGCAACAGAAATATAGGATGGAT
GATACAATTGAAGTTGAGTTGTTTGTTAGACTTACATATAACACAACCCTTGATGTTAAA
AGTGCCTCTATGACAATTGCAACAAATACAGAAACTATTGACATAAATATAACGGATGAA
GGCAGTACTACACTGAAACTGATCAGAGGAGAAGTTAAAAGGTTTCTGTGTCAATTTAAA
GCCAGTCCACATGATAATGGATCGGAAATGAAAATCAAAAATGTATCATTTGTATTGGAC
AGTGACAGGAGAAAAATTATAATGAACTTTAAAATCGATGAAATCAAGAATGTAGAGCCC
ACAGTCCATCCTGAATTACTACACTTCATAATGAGTCCTAAAAGTGACTATGAATTTGAT
TGTATAATGCCTTTGACCACTACATCCATCACCAGCAGGGAATGTAGACTGTCTTTAGAT
ATTAAAAATGCAGTGCCGGCTTTACAAGGCGAGTGGTTTCCCACCACTTTCACAGTAATA
AACCATGAAGACGGTCCCGTTCATGATATGTCAATAGTGCTGACACTTCTAAGCTCTCCT
GATAATCCAAACCCTGAATCGGTCACAGAGTTGGGCTTTAGACACGGTGAACCCGAAGCC
CAACCCATTAAACTCTGTGTCGGAGATGTGAATAAAAGTTCTTCATATTCAAACACATTT
TATTTAAAAACTAACAGAACAGCCACAACAACTGTTCAAATAAAAGTAACGTACACAGTA
GATGCTTATGAAACACCTCAACTTGAATGTTCCAAAGAATTCACAACGAAAATCACAGTG
ATCAAACCGTTTGATGTATCAACCAGTTTCGTGTCCATGAACTTTAAGCCTATAACGAAA
TGCTATGTAGATGATCCCTTTATAGTTATGCCTCAAATAAAAATTTTAAGTCCCTGGAAT
TTAGTTATTTTAGATACAGAACTAGAAACGGTAGAAAGCTTTAGATATGCTGATGAGAAA
AAACCTCAATCATGTATAAGTAACCTACGAGTGGCTGAGAAGAATGTGGCCTCTGATGCT
ATATGTATACAGGCTAACTACAAGCCAAAGGAAGTCGCTACGAGAGTAGGCTTGTACAAC
ATCTCTTGGCGTAGAGAGAGCAACACAGATGGCCATTGTGTTATGAGCACTACTGCCCTC
TCGGCACTTCCAATAGATGATTGCCCAATTACTGTTGAAGTCAATTATCCAGAGGTTGTT
GACCTCCAAACATCCGTGCCATTAAAATGTACTCTAATTGGGAAAACTAATACTCCTATC
AGACTGAGTCTCTCCGTGGAAGGCACAGATGCATATATGTTTTCAGGGTACAAAAAGTTC
TCCATCACTGTACCACCCAGAGATAAGGTCGAGTTATGTTACAACATTCACCCCCTGGTG
GCCGGGAACACAATCCCTCCTCGGTTAAAAGCAACAGTTCTTGGTGACACGTCTAGACAA
GAGGTTGTAAAAGAAATGTTTGACAAAATCTTTCCTCAAAATATTTTTGTTATGCCTAAA
TATAATAAATAA

Protein sequence:

MATQPSDNTEFPPEIILKPLALIGLSGLDTVNNAIHKAIWDAFSNNRRPDRAAVRFKLLN
NTFEFPVVKPKRNSYEWYIPKGILKKNWITKRVSLIPAVVVIFYDMEWNDPQWNEKIIEC
ASRVQSIRAAVEGHATRVAVVVVQSGLSPPPSEYMLGAERAQALCSACEIQSKSLFVLPH
SDHLMGYIIRLENAFYDIAQNYYHHETKNIKQHRDHLNKTTHQYLFVRHQFKLGFLNELK
QDISTAHKHYMHAYNNLLDTRQVDTNVHEIRTVSGYINYKLCKLLFALNLPRDAIAQVKS
HIERYKNRIGPTELLFEHYGWIARQYSAFGELFDEAIRLGLPAIQSQHPGFYYQYAAQFT
VKRRQAMRSVCCDASHYPPAPDPMEGIVEFYGQRPWRPGRLSADPHDPQKEQAAVLALQY
NERIFNHSAMIISFLGSAISQFKTFHSPRMRKQLVVEMANEYYFCADYGKALTLLSHMLW
DYRKEKWWFLASHVLNRALQCAYLSAKIQDYIHLSVEALSKYIQVPNNDKDRIFRNIMAV
LNMNIPSPEPNLPPSSQSKALEMWQLAIDKEPLTIAIDMINIASFLEVKAKFKQQKYRMD
DTIEVELFVRLTYNTTLDVKSASMTIATNTETIDINITDEGSTTLKLIRGEVKRFLCQFK
ASPHDNGSEMKIKNVSFVLDSDRRKIIMNFKIDEIKNVEPTVHPELLHFIMSPKSDYEFD
CIMPLTTTSITSRECRLSLDIKNAVPALQGEWFPTTFTVINHEDGPVHDMSIVLTLLSSP
DNPNPESVTELGFRHGEPEAQPIKLCVGDVNKSSSYSNTFYLKTNRTATTTVQIKVTYTV
DAYETPQLECSKEFTTKITVIKPFDVSTSFVSMNFKPITKCYVDDPFIVMPQIKILSPWN
LVILDTELETVESFRYADEKKPQSCISNLRVAEKNVASDAICIQANYKPKEVATRVGLYN
ISWRRESNTDGHCVMSTTALSALPIDDCPITVEVNYPEVVDLQTSVPLKCTLIGKTNTPI
RLSLSVEGTDAYMFSGYKKFSITVPPRDKVELCYNIHPLVAGNTIPPRLKATVLGDTSRQ
EVVKEMFDKIFPQNIFVMPKYNK