Monarch geneset OGS2.0

DPOGS213008
TranscriptDPOGS213008-TA1482 bp
ProteinDPOGS213008-PA493 aa
Genomic positionDPSCF300024 - 110668-114488
RNAseq coverage218x (Rank: top 45%)
Annotation
HeliconiusHMEL0082007e-15478.82% 
BombyxBGIBMGA006918-TA0.070.82% 
DrosophilaCG6294-PA1e-3231.88% 
EBI UniRef50UniRef50_E1ZW743e-8248.99%Putative zinc metalloproteinase YIL108W n=2 Tax=Formicidae RepID=E1ZW74_CAMFO
NCBI RefSeqXP_001599746.15e-8350.76%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3504267331e-8251.04%PREDICTED: putative zinc metalloproteinase YIL108W-like [Bombus impatiens]
NCBI nr blastxgi|3504267335e-8951.04%PREDICTED: putative zinc metalloproteinase YIL108W-like [Bombus impatiens]
Group
Gene OntologyGO:00170891.6e-34glycolipid transporter activity
GO:00518611.6e-34glycolipid binding
GO:00468361.6e-34glycolipid transport
GO:00057371.6e-34cytoplasm
KEGG pathway 
InterPro domain[19-332] IPR0219172.3e-82Uncharacterised protein family, zinc metallopeptidase-like
[332-488] IPR0148301.6e-34Glycolipid transfer protein domain
Orthology groupMCL15840 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213008-TA
ATGGTAATGGACAACCGAATAGATGAAGAAAATAAAAATGAACAAACCTCAGCAATTTTTATTACAAATTTTCAAAATGGGGAAACAATAAATTATTCTCTGGTTCTAATAAAAGGTTTAATAACAGTTGGACCATGTAACAATAACAAAATAAGATGTACAGTTGACAGCAACGGAAACAAAAACAGTTCAGATTGGGATGTTTGCAATAGAGAATTTAAAACGATAGTTTCTCTTAAGCTTGGTGAAAATAGTATTGAATTTGAATATATTGACCAAATAAAAGTAATAAAATTGTCATATGAACCCAGAAGAACTAATCTCAGAGTATGTCCCGTATATATCATATGTCAGGGACACGATGGGTGTTTTCAGAGTCCCCCCGATGTTGATAACAGTATTGAGAGCGCTTGTAAACGTATAGCTATTGGTGCCAAAATAATTCAAAGTCTAACCGCCGAAAAGCTATTCGAAAGTGGAGTAGGAAGAAAAACATTTCAACTTGAACATGAGGTTAATCAAAAAAGGGAAAGCTGTATTATATTTAAAAGCAACCTTAATGTAAACAAAGCCAGAAAAATGAGGCAAGGAGAACTATGGACCCATTTTGGCAGGGAACTAATGCTTTCAGATTTAGGAAGTAATGACAGAAAATTTTTAGGCTTTATTTCATGTACAAGATTCAAAGGAACGGATGTTGATAAGCCAATGACACATGAAGAAATTGTATTTCTCACAGAAGCTTATGCAGCATTGGGCGGTGGTGGACTGGCTTTATTTGGAACGGCTTGTATGTATACTTGGCCCAGTTCAGTAGAAGAAATCATTCCTAGATTCCTGGATCCCACGCCAGTTAATTCAAAACGGTTTATGGATGACAGTGGCTATAGGGGAACTTTGGGAGCATGTTTTGCAACAACTCTAGGTTCTGTTTTTCATGAATTAGGCCATACGTTTGATCTTGGTCATACAAAAGACGGTATAATGGGAAGAGACCGTCTTGGAAAAGTTTTTGCACCTGTAAAATATGATATGCAAGGAAATGTTGATAAAATAAAAAATCATTATGAATACAATGAGGATACTTGTTTGTTAGAATTAATGTTAGATGAATATTCTAAAGGAAAGAACACAGCGGCTGAAGGAGTCCTATGGTTAAATAGGGCACTGCTGTTTTTTGAATTGCTGTTCCAAGAGATGTTAGTAAGTCTTCAGGCAAAAGATTATGAAGTGAGCATGAAAAAAATATTTACAGTAGCTTATGAAGGTTCTGTAAAAAAGTATCATAGTTGGATCACACAACAACTTTTTAATTTTATGTGTAAAATGTCACCTACATTTATACAAATCCTGAAATCATTTGAGGTTGAGAATGATCTTAAAGGCTTTGAAATGCAACTAGCAAATTTTAATGCAACTTTACATGGAGTAAGAAGCAAAATTGATGAATTTTTTGAAAGGAATCCTGTTTGTGATCTATAA

Protein sequence:

>DPOGS213008-PA
MVMDNRIDEENKNEQTSAIFITNFQNGETINYSLVLIKGLITVGPCNNNKIRCTVDSNGNKNSSDWDVCNREFKTIVSLKLGENSIEFEYIDQIKVIKLSYEPRRTNLRVCPVYIICQGHDGCFQSPPDVDNSIESACKRIAIGAKIIQSLTAEKLFESGVGRKTFQLEHEVNQKRESCIIFKSNLNVNKARKMRQGELWTHFGRELMLSDLGSNDRKFLGFISCTRFKGTDVDKPMTHEEIVFLTEAYAALGGGGLALFGTACMYTWPSSVEEIIPRFLDPTPVNSKRFMDDSGYRGTLGACFATTLGSVFHELGHTFDLGHTKDGIMGRDRLGKVFAPVKYDMQGNVDKIKNHYEYNEDTCLLELMLDEYSKGKNTAAEGVLWLNRALLFFELLFQEMLVSLQAKDYEVSMKKIFTVAYEGSVKKYHSWITQQLFNFMCKMSPTFIQILKSFEVENDLKGFEMQLANFNATLHGVRSKIDEFFERNPVCDL-