Monarch geneset OGS2.0

DPOGS204460
TranscriptDPOGS204460-TA1143 bp
ProteinDPOGS204460-PA380 aa
Genomic positionDPSCF300002 + 403682-406272
RNAseq coverage156x (Rank: top 52%)
Annotation
HeliconiusHMEL0062460.083.20% 
BombyxBGIBMGA007803-TA1e-16378.85% 
DrosophilaCG8111-PA4e-6536.99% 
EBI UniRef50UniRef50_D6WKX61e-6636.51%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WKX6_TRICA
NCBI RefSeqXP_624625.11e-6937.43%PREDICTED: similar to CG8111-PA [Apis mellifera]
NCBI nr blastpgi|3407199783e-6938.52%PREDICTED: transmembrane protein 43 homolog [Bombus terrestris]
NCBI nr blastxgi|3407199781e-6838.59%PREDICTED: transmembrane protein 43 homolog [Bombus terrestris]
Group
KEGG pathway 
InterPro domain[65-293] IPR0124307.7e-44Protein of unknown function DUF1625, TMEM43
Orthology groupMCL13762 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204460-TA
ATGGACTGGTCTGCAGTCTCTAGTCACTTCAAACAGACATGGCTTACTACTATAATGAGCATGATTTTATTTACTGGCGTAACATATTTTTTGATGTGGTCAGAGTGCCAAACTATTCAAAGCAACTTAATGTTAGAAGATCTAATATCAGCTGCAGAAAGCATTGATGTATTTACAAAAGATGAAGCCGAACAATATGAGTCTAAAGTGGTCCATATTACAGGACCTCTTAGGATATTAGAGCCCATCTCAGAACCTGACTATAATATACATGTACAAGCTGTGAAGTTACGAAAGAGAGTGCAAATGTACCAATGGATAGAGGAAACAACGGAAACAGAGAATTTTCTAAGTGAGCCGGCAGAGGAATCTCAAAAGACATACTGGTATCATAAGGACTGGAGAGACTATGTTGTTGATTCATCATTGTTCTATATAAGACCTGGACACCACAACCCCACATCAATGCCAATGTTCAGTGAAACTCATATTGCTGATGATGTGAAAATAGGATGGATGTTTTTAGGTATGGATGTCAAACGTAAAGTAAATGATTATTATGAGATCTGGTCTGACACCCGTCCAGACAGGAGTGACATCAAACTGCACTCCGGCTTTTATTACCATGGCAACAGCGCATTGGAACATGAAATAGGTGATCTTCGTATTCATTTCTCTTATGCCGGCCGAGAGGATGATATTTATACAGCAGTGGGTTTGGTTGAGAGAGCTACTCTTCAACCATATAGCGCTGAACGTTTTCCTACAGCTGACCCCATATCATTACTAAGGAAGGGATCATACAGCTTAAAACAGTTGCACGATTTAGAGAAACGTGATGCAAACACACACACATGGAAATACAGGCTGTTGGGGTTTGTGCAAGTATTTGCATCGGCTATGACTCTGCATCCTGAATGGCTGACATTATTTCTTCAATGCCAATGGATATCTAGTAACTTAAGAAGATGTACAAGATTGTGGGTTAATTTAGTGCTTTCTTTCTCATATACACTGTTTGTGGTCGCTATGCCTTGGTTGCTCCACAAGCCTGCTCTAGGGTTGATGATACTTATGGGCAGTTTTTGTCCGTTGCTACACTACTCCACGCTACTGGCCCGTGAACCACGACATTCGCTTTGA

Protein sequence:

>DPOGS204460-PA
MDWSAVSSHFKQTWLTTIMSMILFTGVTYFLMWSECQTIQSNLMLEDLISAAESIDVFTKDEAEQYESKVVHITGPLRILEPISEPDYNIHVQAVKLRKRVQMYQWIEETTETENFLSEPAEESQKTYWYHKDWRDYVVDSSLFYIRPGHHNPTSMPMFSETHIADDVKIGWMFLGMDVKRKVNDYYEIWSDTRPDRSDIKLHSGFYYHGNSALEHEIGDLRIHFSYAGREDDIYTAVGLVERATLQPYSAERFPTADPISLLRKGSYSLKQLHDLEKRDANTHTWKYRLLGFVQVFASAMTLHPEWLTLFLQCQWISSNLRRCTRLWVNLVLSFSYTLFVVAMPWLLHKPALGLMILMGSFCPLLHYSTLLAREPRHSL-