Monarch geneset OGS2.0

DPOGS206990
TranscriptDPOGS206990-TA933 bp
ProteinDPOGS206990-PA310 aa
Genomic positionDPSCF300001 + 591168-595653
RNAseq coverage49x (Rank: top 70%)
Annotation
HeliconiusHMEL0110665e-11460.32% 
BombyxBGIBMGA012887-TA8e-9551.62% 
DrosophilaCG10300-PA7e-2931.83% 
EBI UniRef50UniRef50_D6X3G04e-3430.66%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X3G0_TRICA
NCBI RefSeqXP_966309.18e-3530.66%PREDICTED: similar to AGAP004762-PA isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|910914141e-3330.66%PREDICTED: similar to AGAP004762-PA isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|910914141e-3329.47%PREDICTED: similar to AGAP004762-PA isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00068101.3e-05transport
GO:00056221.3e-05intracellular
GO:00052151.3e-05transporter activity
KEGG pathway 
InterPro domain[100-282] IPR0012518.6e-29Cellular retinaldehyde-binding/triple function, C-terminal
[2-87] IPR0110741.1e-06Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL19374 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206990-TA
ATGCATCTTAAAGATCAACATCCCCTTTATCCGTGTTCGGAAAGCGATAAGCAAGAAATAAGAAGAGAACTCGGAATGAAAGAAAGCATTCTGCAAGAAGATATTGATGCCATACTGGATTGGTTCAACAAACAAGCACATTTGGTTCACGCTCCCATTGATCGAGACCATATTGAAAAATTGCTAATTTCTACAAACGGATCTCGTGAAAAGACAAAGAGAAGAATCGATAACTTCTACAAGTACAGGTCGCAGGCGCCTGAACTGGTGTTGTCGAGGAGAGAAGTGCTGACAAATCCATTATACAACGGCTGGTCATTTTACCACCAAGCTGCAATGTTCGAGTTATACGAAAAAAAGAGGATATCCATATTTAAAATCACAGACCCAGATCCAAGCAAATTCGATGCAGATGTCATCTTCAGGAACACTATTATGCTGGGTGATTTGAGATTAAAATTTGACTATATGCTGGGGGAGATTTGGATTATGGACCTGGAGAATGTGTCCTTTGGACACATATTGAGAGTTAATCCAACAACAATTCAGAAATTTATGAATATTATACAGGACGGCGTAGGTTTTAAAATGTTTGAAATACATTTTATCAACGTATCAAGTTTCGGTCAGCATGTTGTTAACTTCCTTAAGCAATTTGTAAAGCCTAAGATAATGGAACGCTTTGTATGTCATGAAAATAGTGAAAATCTTCATAAGTATATACCGAAAAAATACTTACCGAAAGACTATGGAGGCGAGCAACCGTCGTTAAATGATATGAAAGCTATTCTCAGAAAGGAATTACTTAAAGACCTGTCAAAGAATTATCTTTTGGAGTGTTGCCAGCAGCTTTCCGACGAAAGCAAAAGAATGGGAACGAAATACCAGGAGGAACATTTAGTCGGATCATTCAAGAAACTAGAATTTGATTGA

Protein sequence:

>DPOGS206990-PA
MHLKDQHPLYPCSESDKQEIRRELGMKESILQEDIDAILDWFNKQAHLVHAPIDRDHIEKLLISTNGSREKTKRRIDNFYKYRSQAPELVLSRREVLTNPLYNGWSFYHQAAMFELYEKKRISIFKITDPDPSKFDADVIFRNTIMLGDLRLKFDYMLGEIWIMDLENVSFGHILRVNPTTIQKFMNIIQDGVGFKMFEIHFINVSSFGQHVVNFLKQFVKPKIMERFVCHENSENLHKYIPKKYLPKDYGGEQPSLNDMKAILRKELLKDLSKNYLLECCQQLSDESKRMGTKYQEEHLVGSFKKLEFD-