Monarch geneset OGS2.0

DPOGS210587
TranscriptDPOGS210587-TA1299 bp
ProteinDPOGS210587-PA432 aa
Genomic positionDPSCF300168 - 457100-464472
RNAseq coverage53x (Rank: top 70%)
Annotation
HeliconiusHMEL0174131e-6659.26% 
BombyxBGIBMGA013576-TA5e-10954.16% 
DrosophilaCG31272-PA9e-3527.03% 
EBI UniRef50UniRef50_D6WB981e-4528.50%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WB98_TRICA
NCBI RefSeqXP_972569.12e-4628.50%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
NCBI nr blastpgi|910778684e-4528.50%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
NCBI nr blastxgi|910778687e-4828.50%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
Group
Gene OntologyGO:00550857.7e-22transmembrane transport
GO:00160217.7e-22integral to membrane
KEGG pathwaytad:TRIADDRAFT_365215e-37 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[1-399] IPR0161962e-33Major facilitator superfamily domain, general substrate transporter
[32-402] IPR0117017.7e-22Major facilitator superfamily
Orthology groupMCL26092 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210587-TA
ATGGTGGTGAAAACGAATACTAAAACCTTCGAGGAGGCTTTATACATGACTGGTTTCGGCAGGTTCAACTGCTTGATGATGTTGGTAAATATTAGCGTGATCTTGGCCATGGCATTTGAAGTTGTCTCCGTGGCGTACTTGGTGCCAGCCAGCGCTTGTGAACTGAAAACAACGAACGCTCAACAGGGACTGATGGCGGGAATACCGCTGATGGGAATAATCGCGACGTCCCATTTCTGGGGATACCTGGCTGATACACGAGGTAGAAGGAAGATACTCATTGTATGCATGTCATTGGGTTTCTTGGCCGGGTCTTTGTCAGCATTATCACCCAACTGGATCATGTTTAGCGTTTTAAAGTTTTTGTCTTCTTGCGCAGTATCCGGGACATACGCTTTAGCCTTAACATTACTTAGCGAGTGTACTCCACATCACAAAAGATCCATAATGGTAGCACTCACCAGCACCATTTATCTCACTTCCACGGGAATAATGGCAGTGCTAACAATACCCGTGCTGCACCTGGACATGGCTTACCCGATACCATATCTTAATATAGATTTTATCCCGTGGCGTCTCCTGACTTTGGTTTTCGCTTTCCCATGTGCGTTCGCCGCCTTAGCTTTATACTTCGCGTTCGAGAGTCCCAGGTTCTTACTTCGGATCGGAGAGGAGGAAAAGGCTTTGAATATTATAAAAAGCATATTTTCAATTAATAGTGGAAAGAGCGGTGATGATTTCAATGTGGACTCATTAATTTTAGGTGAGGACGCTGGCAAGTTTGTAAAAGGTATGTGGGCATCGCTAGCTGCTCAAACAATTCCGCTGTTGAAACCTCCCCTTTTGAAAAATACACTGTTGTTGGGAGTTTTGTTCACTATAATTTATTTTGCTTTGAATTCTGTGTTAGTTTGGCTGCCTTTTATAGCGGATGCCATGATGAAATCTATTGAAAAGGGAGATAACAATTTGACCATCTGTGACATGATTCGTCGAGTACAGAATACACCGATAGAAGACGTCAATCAGGACTGCGCCCTCAACGAGTTCGGCATGATCTCAGTGTTTATAATAAGTGTTATGATAGCGACTTTCAATGTTGTTCTCAGCACACTTATCAACAAAATGGGCAGAAAACGTCTCCTGGTTTGTGTACAGAGCCTCTCTCTGTGCCTCGGTTCGGGCCTGCTCTATGTCAACCAGAGCAAGCTCATCACCCCGCTGTCTCGCCTCGACTTTGTGATGATACACCTCCGCAAGCACAAGTGCATCGATTTCCAAAGGAGGCGTGGCCGCTAG

Protein sequence:

>DPOGS210587-PA
MVVKTNTKTFEEALYMTGFGRFNCLMMLVNISVILAMAFEVVSVAYLVPASACELKTTNAQQGLMAGIPLMGIIATSHFWGYLADTRGRRKILIVCMSLGFLAGSLSALSPNWIMFSVLKFLSSCAVSGTYALALTLLSECTPHHKRSIMVALTSTIYLTSTGIMAVLTIPVLHLDMAYPIPYLNIDFIPWRLLTLVFAFPCAFAALALYFAFESPRFLLRIGEEEKALNIIKSIFSINSGKSGDDFNVDSLILGEDAGKFVKGMWASLAAQTIPLLKPPLLKNTLLLGVLFTIIYFALNSVLVWLPFIADAMMKSIEKGDNNLTICDMIRRVQNTPIEDVNQDCALNEFGMISVFIISVMIATFNVVLSTLINKMGRKRLLVCVQSLSLCLGSGLLYVNQSKLITPLSRLDFVMIHLRKHKCIDFQRRRGR-