Monarch geneset OGS2.0

DPOGS207774
TranscriptDPOGS207774-TA1677 bp
ProteinDPOGS207774-PA558 aa
Genomic positionDPSCF300042 - 160852-162528
RNAseq coverage164x (Rank: top 51%)
Annotation
HeliconiusHMEL0175770.078.71% 
BombyxBGIBMGA005329-TA0.074.06% 
DrosophilaCG3149-PA4e-15854.11% 
EBI UniRef50UniRef50_Q9Y1236e-15654.11%Protein RFT1 homolog n=14 Tax=Endopterygota RepID=RFT1_DROME
NCBI RefSeqXP_975124.15e-17154.15%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910897379e-17054.15%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910897371e-16953.97%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00053195.1e-183lipid transporter activity
GO:00068695.1e-183lipid transport
GO:00160215.1e-183integral to membrane
KEGG pathwaytca:6640071e-170 
 K06316 (RFT1)maps-> N-Glycan biosynthesis
InterPro domain[1-554] IPR0075945.1e-183RFT1
Orthology groupMCL14998 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207774-TA
ATGGGTCGCAATTTATTGATGAGTAGCCTCGAAAATGCATCCTTTAATATACTCCTGCAAATATTGTTTAGATGCGTAACTTTCATAATTAATGCATGGGTTATTAGAAATGTTGGTCATGAAGTTATTGGTATTATGAATGTCAGGTTATTGTTGCTTGAAAGTACTATATTATTCCTAAGCCGAGAACCCTTCCACCGCGCTTGCTTGGGTCAAAAGGGTGAGTTTAATTGGAATCAGGTCATTAACCAGATTTGGTTATCAGTGCCTTTAAGTTGTGTTTTATCTTCAATTTTCATTTATATTTGGTTAAATATTCTACCACTTGGCCATCCAGAACATTCTTCTCAATATACCTTTGGCTGCTGGAGTGTAGCTTTCTCATGTGTATTGGAACTTTGTTCTGCCAATATGATGCTTGTGTCCCAACTCTATTGTTTTGTCAAGTTAAAAATTATTTTGGACACTTTGCACATATTTATCAGAACTATTATATTTATTTCTATAATCGTTTATGATAGGTCTGCTGCTTTAATTGCATTTTCGGTAGCCCAAGTCGTTAGTATTGCTGCTATAGTTGTATCTTATTACATATTTTTCTATTGGTACATAAAATGTAAACCGTTATATGCAAAAGGTGCTCTGAAGACTCGGTTTCTGTCTGCTAAAACTCTGGATACTCTTTTCAGTGACATGGATGATTTTAATTTTATATCTCTGAGAGATTTCTTTCCAAAATATTTGGGTTCAATAAATTCATGTTTTAATAAAAAATTAAACACTCTAACATTAAGTTTCGCTAAACAGGGAGTAGTTAAACAACTGCTGACCGAGGGTGAGAAATATGTGATGTCTGCAAGTCCTGTGATGACATTTAGTGAACAAGCCACTTATGATGTTGTTAATAACTTAGGAAGTCTTGCTGCAAGATTTGTATTTCGACCAATCGAAGATAGCAGTTACTTTTATTTCACACAAATGGTTAGTCGTGATCTTCCCTTGTATAAGCAAGATCGGAACAAAATCCACGAATCTTGTACAGTGTTATATCAAGTTTGTAAAACTGTTAGTTCTATAGGTTTAATTGTATTGGTTTTTGGACTCAGTTATTCTTCTACATTACTAACTTTGTATGGAGGGGAAGCGTTTGTAGCCAGTGGATTACCAGTTACTTTACTTCAAAGCCATTGTTTCGCTATTGTGCTAATGGCTGTCAATGGCATAACAGAATGTTACACGTTTGCTACAATGACTAGTGCCCAATTGAATAGTTACAACTATCTAATGGTATTCTTCTCAATAAGTTTCCTGATACTGTCATATGTATTGACATACGTTTTTGGTCCAGTTGGTTTTATTATATCTAATTGTATAAATATGTTCGCAAGGATTTTGCATAGTGTACATTTTATTAACGATAAACATAAGGATACAGATCATAGACCTTTGCACGGTCTGTACGTCGGAAAATTATTTCTATTTACATTGTTTTTGGCTGGTTGTATCTGCAAAGCATCTGAACATAATCTTTCTAAAAATATGTTAACCCATATAGCAATTGGAATGGTATGCCTATTTTTTGTACTGTTATCTTGGAGTGTAGAAAATAAAGATCTCTTAAAGAAAATATACGCAAAATTCACAAGGACAGAAGAAAACAAAGTTTCAACAGACTAA

Protein sequence:

>DPOGS207774-PA
MGRNLLMSSLENASFNILLQILFRCVTFIINAWVIRNVGHEVIGIMNVRLLLLESTILFLSREPFHRACLGQKGEFNWNQVINQIWLSVPLSCVLSSIFIYIWLNILPLGHPEHSSQYTFGCWSVAFSCVLELCSANMMLVSQLYCFVKLKIILDTLHIFIRTIIFISIIVYDRSAALIAFSVAQVVSIAAIVVSYYIFFYWYIKCKPLYAKGALKTRFLSAKTLDTLFSDMDDFNFISLRDFFPKYLGSINSCFNKKLNTLTLSFAKQGVVKQLLTEGEKYVMSASPVMTFSEQATYDVVNNLGSLAARFVFRPIEDSSYFYFTQMVSRDLPLYKQDRNKIHESCTVLYQVCKTVSSIGLIVLVFGLSYSSTLLTLYGGEAFVASGLPVTLLQSHCFAIVLMAVNGITECYTFATMTSAQLNSYNYLMVFFSISFLILSYVLTYVFGPVGFIISNCINMFARILHSVHFINDKHKDTDHRPLHGLYVGKLFLFTLFLAGCICKASEHNLSKNMLTHIAIGMVCLFFVLLSWSVENKDLLKKIYAKFTRTEENKVSTD-