Monarch geneset OGS2.0

DPOGS200317
TranscriptDPOGS200317-TA1509 bp
ProteinDPOGS200317-PA502 aa
Genomic positionDPSCF300026 + 64882-68669
RNAseq coverage294x (Rank: top 38%)
Annotation
HeliconiusHMEL0063662e-13289.72% 
BombyxBGIBMGA005613-TA0.079.25% 
DrosophilaCG5262-PA5e-14752.19% 
EBI UniRef50UniRef50_E2AFQ62e-14855.18%Transmembrane protein 104-like protein n=9 Tax=Endopterygota RepID=E2AFQ6_CAMFO
NCBI RefSeqXP_001656073.17e-15052.36%hypothetical protein AaeL_AAEL000399 [Aedes aegypti]
NCBI nr blastpgi|3407170139e-15455.75%PREDICTED: transmembrane protein 104 homolog [Bombus terrestris]
NCBI nr blastxgi|3407170137e-15355.75%PREDICTED: transmembrane protein 104 homolog [Bombus terrestris]
Group
KEGG pathway 
InterPro domain[139-491] IPR0130573e-09Amino acid transporter, transmembrane
Orthology groupMCL13559 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200317-TA
ATGCCGCGTAGACCGTGGATAATAAGAAGAATATTTAGAATGCCTGTTGCAGAGACCAGTGATCAATATTCCGTATGGGTTGGATTGATATACATATTCAACCTCATTGTGGGGACCGGTGCTCTGACTTTGCCAGCAGCCTTTGCAAGAGCAGGATGGGGACTCAGCACAATATCCCTTGTGTTCTTGGCCTTTATGAGTTTTATGAACGCTACTTATGTGGTTGAAACCATGGCATGTGCCAACGCTGTGTTCAAATGGAAAAGATTGCAATTTATTAAGAGAAACAGTGTTCAAGAAAATGAATCCGATGAGGAAGCTACTTCACAAAACTATGGAGACATGGAGGATCCGTTGGTGTGTGGGGACTCGATACCTTCTCGATACTATAGTCTCGACAATCGGATAGAATTAGGAGAAATGGCCAATTTATTCCTCAACAAAACTGGACGCACAATGTTCTACTGCACACTCTGTGTGTATTTATATGGAGACTTATCTATATATTCAGCTGCGGTATCCAAGTCACTTATGGATGTCATATGTACAACAATCCCCTCGAATATGACGAATTCGTCGGATTGGGACGTTCTACCATGCTTCGTATCATCAGGAACAGCTCAGTACACGAGATTCGAGTGCTATAGAATATCGTTACTGACATTCTTTGGAGTCATGGGACCGTTTGTTTTCTTCAACGTGCAGAAGACCAAGTACTTGCAGCTGTTCACGTCTGGGATGAGGTGGCTCGCGTTCGCCATTATGATAACGATGGCGATCCACCTGCTGGTCGTGGACGGTCCTCAGGGCAGGCCGCCGGCGTTTGACTTCACAGGGATGCCGACGTTGTTTGGTGCCTGCGTGTATTCCTTCATGTGTCACCATTCACTGCCAGCTCTCATCAGTCCCATTCGAGGAAAAAGCCGCCTGAACCTCCATCTCTCTCTCGACTACGCCCTCATAAGCATCTTCTATCTCCTTCTAGCATTCACAGGAGCATTCGCTTTTGCCAATCTAAACGACCTGTACACTCTAAACTTCGTGCCAACAGATAACGAGAACATATTCCTTGAAATCGTCGAATACTTCCTCGCCCTTTTCCCAGTTTTCACGCTCTCCACCAGTTTTCCAATCATCGCGATCACGCTCAGGAATAACCTGCAAAGCTTATTCCTGGACACCAGTAGATTAGATTCGTACAACTTTGTCCTTAGGAAGCTGGTGTTCCCGGTAGTGGCGGTGGTACCACCGTTGCTACTGACCTACTTCTTAGAAGATATAAGTATACTGATAAAGTTCACCGGCTCTTACGCCGGCACCGGGATACAATATTTGATGCCGACGTTCCTAGTGCTCTCAGCCAGAAGGCATTGCAGCAATCTGTTAGGTCTAGGCGTCGTCAATAAATACAAAAGTCCTTTCTCGAACGTAGCGTGGGCGGCGTTCGTTCTGATGTGGTCCTTCATGTGCATTATATTAGTGTCAGTTAACATGTTCGAGAGACGATGA

Protein sequence:

>DPOGS200317-PA
MPRRPWIIRRIFRMPVAETSDQYSVWVGLIYIFNLIVGTGALTLPAAFARAGWGLSTISLVFLAFMSFMNATYVVETMACANAVFKWKRLQFIKRNSVQENESDEEATSQNYGDMEDPLVCGDSIPSRYYSLDNRIELGEMANLFLNKTGRTMFYCTLCVYLYGDLSIYSAAVSKSLMDVICTTIPSNMTNSSDWDVLPCFVSSGTAQYTRFECYRISLLTFFGVMGPFVFFNVQKTKYLQLFTSGMRWLAFAIMITMAIHLLVVDGPQGRPPAFDFTGMPTLFGACVYSFMCHHSLPALISPIRGKSRLNLHLSLDYALISIFYLLLAFTGAFAFANLNDLYTLNFVPTDNENIFLEIVEYFLALFPVFTLSTSFPIIAITLRNNLQSLFLDTSRLDSYNFVLRKLVFPVVAVVPPLLLTYFLEDISILIKFTGSYAGTGIQYLMPTFLVLSARRHCSNLLGLGVVNKYKSPFSNVAWAAFVLMWSFMCIILVSVNMFERR-