Monarch geneset OGS2.0

DPOGS213876
TranscriptDPOGS213876-TA2487 bp
ProteinDPOGS213876-PA828 aa
Genomic positionDPSCF300141 - 23747-35133
RNAseq coverage45x (Rank: top 71%)
Annotation
HeliconiusHMEL0034685e-17069.67% 
BombyxBGIBMGA013429-TA4e-12063.07% 
DrosophilaCG31272-PA5e-7034.55% 
EBI UniRef50UniRef50_D6W8I75e-7938.29%Putative uncharacterized protein n=4 Tax=Tribolium castaneum RepID=D6W8I7_TRICA
NCBI RefSeqXP_001994652.11e-10429.03%GH17356 [Drosophila grimshawi]
NCBI nr blastpgi|1950554943e-10329.03%GH17356 [Drosophila grimshawi]
NCBI nr blastxgi|910767543e-7837.01%PREDICTED: similar to synaptic vesicle protein [Tribolium castaneum]
Group
Gene OntologyGO:00550851.3e-17transmembrane transport
GO:00160211.3e-17integral to membrane
GO:00228574.2e-14transmembrane transporter activity
KEGG pathwaytca:6601552e-41 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[404-820] IPR0161961.6e-41Major facilitator superfamily domain, general substrate transporter
[59-341] IPR0117011.3e-17Major facilitator superfamily
[405-636] IPR0058284.2e-14General substrate transporter
Orthology groupMCL14702 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213876-TA
ATGACCGTGGCGGACGCGGAGGCCGCTTCCAGGAAGGAAGAGATCTCCCACAACCACCTGTACAAGGTCACGGGACTGTCCTCGTCCAATTTGGAGAAATTAGCTCAAGAACCAGAGGCAGACTTCGAAGAAGCAATATCAGCAACAGGCTACGGGTGGTTTAACGTGATGCTACTCCTCTGCACCCTGCCAGCCTTTTGGAGCGCGGTGTCTATCACCAGCGCCGCCTCCTACATCTTCAGCAGGGCTCAGTGCGACATGGAGCTGCGACTGCACGACCTGGGCACCGTCACCGCTATGTCTTACATCGGAATGATCAGCTCGGCCATGGTGTGGGGATACGTCTCCGACACCCTTGGGAGGAGGAGTATCCTGGTGTGGGGGTGTCTCAGCAGCGGCCTGGTGGAGGTCGTGGCTGCCATCAGCCAGAGCTTCACCATGCTTCTTGTGATGAGATTCGCTAGCGGGTTTTTATTCAACGGTCCCTTCGCAGTGCTAATCTCTTACCTCGCTGAGCTCCACCGAGCTGACATCCGAGCTCGTGTCATTCTCCTCTCCAGTTTCTTCTTCACCCTGGCCAACACCACCCTCCCGCTACTGGCGTGGGCTATTATCACCCAAGACTGGGAATTCACACTTTTCGGAGGTGGAATGGTCCTCCATTCGTGGAACATATTCCTGTTGGCCACGGCGATGGTGCCATTACTGACGGGACTGGCAGCCGTCTGCTTGCCAGAGAGCCCCAAGTTCCTCATGTCGAGAGGTCGCAACGATGAAGCTTTAGTAATATTGAAAAAAATATACTCCTGGAACACTGGCAGGCCACCCGAGACCTACCCGATAACTCGCCTAGCTCAAGAGAAACATCCACAACGTGGCCGCGGGCTGGAGGCGCTCCAGGGCGGCGTGGCCCAGCTCTCGCCGTTGTTTCGTCGACCGCATGCCGCTTGGTTGCTGCTGATATGCGTCGCACACGTGTGCTGCATGTTCGGAGCAAACACTGTTCGTCTGTGGTACCCGCAACTGGCAGCCATGATAGGCTCCGAGAGTAACGCCAGCCTCTGCTCCGCCATCGCCCCGCAGCCGCTCGCGGACGAGGTCGCAGACTGCACGCCCATAGAGACGGACATGCTCACCTACTTGCAGAACGCGGTGGTGGGAGCTGGATCTGTGCTCACTTACGGAATAGGAGGCGTGCTCATCAATCGCTCGGCCATGGTGTGGGGATACGTCTCCGACACCCTGGGGAGGAGGAGTATCCTGGTGTGGGGGTGTCTCAGCAGCGGCCTGGTGGAGGTCGTGGCTGCCATCAGCCAGAGCTTCACCATGCTTCTTGTGATGAGATTCGCTAGCGGGTTTTTATTCAACGGTCCCTTCGCAGTGCTAATCTCTTACCTCGCTGAGCTCCACCGAGCTGACATCCGAGCTCGTGTCATTCTCCTCTCCAGTTTCTTCTTCACCCTGGCCAACACCACCCTCCCGCTACTGGCGTGGGCTATTATCACCCAAGACTGGGAATTCACACTTTTCGGAGGTGGAATGGTCCTCCATTCGTGGAACATATTCCTGTTGGCCACGGCGATGGTGCCATTACTGACGGGACTGGCAGCCGTCTGCTTGCCAGAGAGTCCCAAGTTCCTCATGTCGAGAGGTCGCAACGATGAAGCTTTAGTAATATTGAAAAAAATATACTCCTGGAACACTGGCAGGCCACCCGAGACCTACCCGATAACTCGCCTAGCTCAAGAGAAACATCCACAACGTGGTCGCGGACTGGAGGCGCTCCAGGGCGGCGTGGCCCAGCTCTCGCCGTTGTTTCGTCGACCGCATGCCGCTTGGTTGCTGCTGATATGCGTCGCACACGTGTGCTGCATGTTCGGAGCCAACACTGTTCGTCTGTGGTATCCGCAACTGGCAGCCATGATAGGCTCCGAGGGTAACGCCAGCCTCTGCTCCGCCATCGCCCCGCAGCCGCCCGCGGACGAGGTCACAGACTGCACGCCCATAGAGACGGACATGCTCACCTACCTGCAGAACGCGGTGGTGGGAGCTGGATCTGTGCTCACTTACGGAATAGGAGGCATGCTCATCAATCGCTGTGGCAAGAAGATGGTGGCGGGTGTGTGTGGGGTGATGGGCGCTGTGTTCGTGGGCCTGCTGCCTCTCGTCGGCAGCAGTTCGTTCCCAGTGGTCGCCATAGTGACCACGGCCCTAGCCCTCACCGCCTTGTGCTCCGCCTCCCTCTCCAGCATTGCTGTAGACTTGTTCCCCACATCATTGAGGGTGATGGCGATGGCCGTGTTCCTCATGTCGGGTCGCATGGGCACCATATCTGGAACCATCGTCTTCCCCATACTCATAGACTTCGGTTGCCTACCTCCTTTCCTGACCATAGCTGCTGCTCTAATGGCGACCCCTCCTCAATGGACTATATACATGAGAGAACGTCTTCAGGAGACATCATCCAGGGGTTACATTTCATCATAA

Protein sequence:

>DPOGS213876-PA
MTVADAEAASRKEEISHNHLYKVTGLSSSNLEKLAQEPEADFEEAISATGYGWFNVMLLLCTLPAFWSAVSITSAASYIFSRAQCDMELRLHDLGTVTAMSYIGMISSAMVWGYVSDTLGRRSILVWGCLSSGLVEVVAAISQSFTMLLVMRFASGFLFNGPFAVLISYLAELHRADIRARVILLSSFFFTLANTTLPLLAWAIITQDWEFTLFGGGMVLHSWNIFLLATAMVPLLTGLAAVCLPESPKFLMSRGRNDEALVILKKIYSWNTGRPPETYPITRLAQEKHPQRGRGLEALQGGVAQLSPLFRRPHAAWLLLICVAHVCCMFGANTVRLWYPQLAAMIGSESNASLCSAIAPQPLADEVADCTPIETDMLTYLQNAVVGAGSVLTYGIGGVLINRSAMVWGYVSDTLGRRSILVWGCLSSGLVEVVAAISQSFTMLLVMRFASGFLFNGPFAVLISYLAELHRADIRARVILLSSFFFTLANTTLPLLAWAIITQDWEFTLFGGGMVLHSWNIFLLATAMVPLLTGLAAVCLPESPKFLMSRGRNDEALVILKKIYSWNTGRPPETYPITRLAQEKHPQRGRGLEALQGGVAQLSPLFRRPHAAWLLLICVAHVCCMFGANTVRLWYPQLAAMIGSEGNASLCSAIAPQPPADEVTDCTPIETDMLTYLQNAVVGAGSVLTYGIGGMLINRCGKKMVAGVCGVMGAVFVGLLPLVGSSSFPVVAIVTTALALTALCSASLSSIAVDLFPTSLRVMAMAVFLMSGRMGTISGTIVFPILIDFGCLPPFLTIAAALMATPPQWTIYMRERLQETSSRGYISS-