Monarch geneset OGS2.0

DPOGS201886
TranscriptDPOGS201886-TA1395 bp
ProteinDPOGS201886-PA464 aa
Genomic positionDPSCF300191 + 313371-322456
RNAseq coverage1811x (Rank: top 7%)
Annotation
HeliconiusHMEL0131017e-2237.93% 
BombyxBGIBMGA006063-TA5e-3749.14% 
DrosophilaLamp1-PA6e-1931.39% 
EBI UniRef50UniRef50_Q7Q1R06e-2033.33%AGAP009668-PA n=3 Tax=Culicidae RepID=Q7Q1R0_ANOGA
NCBI RefSeqXP_318716.41e-2033.33%AGAP009668-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582985382e-1933.33%AGAP009668-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582985388e-3737.41%AGAP009668-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160202e-20membrane
KEGG pathwayaga:AgaP_AGAP0096683e-20 
 K06528 (LAMP1_2, CD107)maps-> Lysosome
    Phagosome
InterPro domain[218-450] IPR0020002e-20Lysosome-associated membrane glycoprotein
Orthology groupMCL15028 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201886-TA
ATGATTCGGTTAAGGTTTTATTTGGTTTGGGCAGTAGTCTGCAGTTCAATGGTTTTCTGTCAAGGCTCTCTGACACCGCCAAAACCAGTTGTAATACCCGTGCCGGTCCCGTCCCTTGCGGCAAGTCCTGCGGTTAGCAGTGCTGCGTCTGAAGAACCAAAGATTTCGACGGAAATAATTTCCACGACAACAAAACCGTCGTCTGAACCAGAACCTACTACTACCACACCTACCACTCCTCCGACGACTACTTCGACTACTCCTTCAACAACAACTCCAACAACACCTCCGACGACTCCTCCAACCACAACCCCAACCCCTACAACTACAACCCCGACACCTCCCCCAACGCCCCCTCCGGCCCCGACACCAGCTCCTGGCCCTCTTCCACCACCACAACAAGGAACTTGGTCCTGGACTGACAAAAACAATGTGACCTGCATCCTCATTAAGTTCGCTGCTCAGCTGAATGTCACCTACCCAATCGATAACTCATGTCAAGGCTCTCTGACACCGCCAAAACCAGTTGTAATACCCGTGCCGGTCCCGTCCCTTGCGGCAAGTCCTGCGGTTAGCAGTGCTGCGTCTGAAGAACCAAAGATTTCGACGGAAATAATTTCCACGACAACAAAACCGTCGTCTGAACCAGAACCTACTACTACCACACCTACCACTCCTCCGACGACTACTTCGACTACTCCTTCAACAACAACTCCAACAACACCTCCGACGACTCCTCCAACCACAACCCCAACCCCTACAACTACAACCCCGACACCTCCCCCAACGCCCCCTCCGGCCCCGACACCAGCTCCTGGCCCTCTTCCACCACCACAACAAGGAACTTGGTCCTGGACTGACAAAAACAATGTGACCTGCATCCTCATTAAGTTCGCTGCTCAGCTGAATGTCACCTACCCAATCGATAACTCATCGTCTCCGTTGGGCCACCTGGTGATGAACGTCCCCTCCACCGCCGTGGTGGTGAACGGCAGCTGTGACGGGTCTCAGCAGTGGGTGAACATCACGTGGCCGGTGTGGGACAAGCCCGCCTCGCCCATGAACAGCATGCTGATGGTGTTCGCCAACAACGCCACCACCAAGCTCTACTCCCTGGAACATCTCAACCTCTTGCTCATGCCGGAAGTCTTCCCTAACGCATCATCCGAGATGTTCGTCCACTGGCAGGGCTCCACGTGGCGAACTCCCCAGGCGACCTCCTACCGCTGCTCCCCTCCCACTCAGCTCAACCTCACAGCCGATGCCGTGAACCAGGTCGCCACCCTCACCCTGACGCAGCTCCAGGAGGAAGCCTTCCGGACATCCGGAAACAGCAGCCAATTCAGTGCTCTCGTAGCCGTTTTAATATTTGTTGTCTGTGAACTTATCGATTAG

Protein sequence:

>DPOGS201886-PA
MIRLRFYLVWAVVCSSMVFCQGSLTPPKPVVIPVPVPSLAASPAVSSAASEEPKISTEIISTTTKPSSEPEPTTTTPTTPPTTTSTTPSTTTPTTPPTTPPTTTPTPTTTTPTPPPTPPPAPTPAPGPLPPPQQGTWSWTDKNNVTCILIKFAAQLNVTYPIDNSCQGSLTPPKPVVIPVPVPSLAASPAVSSAASEEPKISTEIISTTTKPSSEPEPTTTTPTTPPTTTSTTPSTTTPTTPPTTPPTTTPTPTTTTPTPPPTPPPAPTPAPGPLPPPQQGTWSWTDKNNVTCILIKFAAQLNVTYPIDNSSSPLGHLVMNVPSTAVVVNGSCDGSQQWVNITWPVWDKPASPMNSMLMVFANNATTKLYSLEHLNLLLMPEVFPNASSEMFVHWQGSTWRTPQATSYRCSPPTQLNLTADAVNQVATLTLTQLQEEAFRTSGNSSQFSALVAVLIFVVCELID-