Monarch geneset OGS2.0

DPOGS201788
TranscriptDPOGS201788-TA1344 bp
ProteinDPOGS201788-PA447 aa
Genomic positionDPSCF300145 - 232367-237371
RNAseq coverage517x (Rank: top 24%)
Annotation
HeliconiusHMEL0035572e-12788.72% 
BombyxBGIBMGA013178-TA0.086.76% 
DrosophilaCG3556-PA1e-13355.23% 
EBI UniRef50UniRef50_Q9W4K22e-13155.23%Uncharacterized protein CG3556 n=10 Tax=Diptera RepID=Y3556_DROME
NCBI RefSeqXP_001976882.12e-13255.23%GG18709 [Drosophila erecta]
NCBI nr blastpgi|1948882323e-13155.23%GG18709 [Drosophila erecta]
NCBI nr blastxgi|1951328991e-13053.98%GI21790 [Drosophila mojavensis]
Group
Gene OntologyGO:00158895.2e-13cobalamin transport
GO:00314195.2e-13cobalamin binding
KEGG pathway 
InterPro domain[153-294] IPR0089306.8e-17Terpenoid cylases/protein prenyltransferase alpha-alpha toroid
[42-257] IPR0021575.2e-13Cobalamin (vitamin B12)-binding transporter, eukaryotic
Orthology groupMCL16010 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201788-TA
ATGTCATTAGAAACTGAACCAACGACCACTGTGGCCACTAAGGCTGTGACCACAGTGGCTCCCACTACCGCTACATGGGAGTCGGAGACTTTGAACGAGGGCCAGGCGATCCAGAAGGCTCTCCAATATCTCCTGAATCATCGCGAGTCCGACTGGGGCTGGGGCAATGACACCCACCACGTTATGTTGACTTTGCAGTTAGCAAATAATACCGGGAAGGAAATAGAGCAACAAGGATTGGAGATGCAGCTGTCGGCCAAACAAATGGAGATTGAAATACTGCTCATGATGTCCAAGCACCATGAGTCCCCTCCTCCGCTAGGGCGCCTCGCTGCCTACACGCTGGCCTTGGGAGCGCTATGCAAGGATCCTCGTTCTTTCCACGGAAGGGACCTGGTAGCCGCTCTCCTACATCGAGAACCACCCCACGACCTCGAATTCGCTTACGCCACTTTAGCAGCCTGTTCCTCAGCGGCTCATGTAAGACGCCGCCACATCAGGAGACTCCTAGATATCGCCAACGCTGCAGCCGATCATAGCCTCGATACCATATCGATGGTGATCCTGGCTTTGCGCTGCGTGGTTCAAGATCATCGTCACCGTAGCCTGATACATTTCGTGCGCCGTCCTATGGCCGGCCTGGCTCGTCAACAACACCCCGACGGTAGCTTCGGGTCGCTGACCACCACCGCGTTGGCCATACAAGCTCTGGAAGATTCCGACACAGGTCCCGGTGCCCATCAACATTGGAGTCTACCGGCTGCCCGCAAGTGGCTTCTGGAGTGTCAGTCTTCGGACGGAGGTTGGGGCGACGCTTCCAGTACAGCAGCCGCTGTAGCCGCTCTCACGCCAGCATCTCTGGCCGCTGTGCGACCACCGCATTGCAGCGACAAACTCCTAGACAGCCGCCATGAACCACTCGACAACAACGGCGGTGACGGAACTCTGAAGTTGGCATACCAATCCCACTCTACAAACGACTCCGATGCGCGAAATGTTTCCTTCACCTACACTCTATGGCTCGGCACCAATGTCACTGAGAACTACACCCTGTATATGGTAGCTCCGAGGAACATTTCATTCTACCACGTGATGCAAATGGCCGCTGAGCAGGAACCGAAATTCAAATTCGAAGCCAGCGAATGGCCAAACGGTCACTACGTTCACACCCTCGCCGGTCACAAGGAAGAACCCATGGGATATCACTACTGGCTGCTCTACCGTCTGCCAGAAATCCCTGACCCCGCCAGCCCACCCGGCAACCAACTCGTCGCTCCTGTCGGTGTGGACGACTTGATGGTGGAAGACGGAGAACATTATTTGTTCTGGTACAAGAAGCTGTAA

Protein sequence:

>DPOGS201788-PA
MSLETEPTTTVATKAVTTVAPTTATWESETLNEGQAIQKALQYLLNHRESDWGWGNDTHHVMLTLQLANNTGKEIEQQGLEMQLSAKQMEIEILLMMSKHHESPPPLGRLAAYTLALGALCKDPRSFHGRDLVAALLHREPPHDLEFAYATLAACSSAAHVRRRHIRRLLDIANAAADHSLDTISMVILALRCVVQDHRHRSLIHFVRRPMAGLARQQHPDGSFGSLTTTALAIQALEDSDTGPGAHQHWSLPAARKWLLECQSSDGGWGDASSTAAAVAALTPASLAAVRPPHCSDKLLDSRHEPLDNNGGDGTLKLAYQSHSTNDSDARNVSFTYTLWLGTNVTENYTLYMVAPRNISFYHVMQMAAEQEPKFKFEASEWPNGHYVHTLAGHKEEPMGYHYWLLYRLPEIPDPASPPGNQLVAPVGVDDLMVEDGEHYLFWYKKL-