Monarch geneset OGS2.0

DPOGS207786
TranscriptDPOGS207786-TA1098 bp
ProteinDPOGS207786-PA365 aa
Genomic positionDPSCF300042 + 158321-160685
RNAseq coverage128x (Rank: top 56%)
Annotation
HeliconiusHMEL0175781e-12865.40% 
BombyxBGIBMGA005480-TA8e-15072.91% 
DrosophilaCG14980-PB2e-2727.16% 
EBI UniRef50UniRef50_E2B8123e-7440.00%UPF0550 protein C7orf28-like protein n=8 Tax=Endopterygota RepID=E2B812_HARSA
NCBI RefSeqXP_967321.22e-7438.48%PREDICTED: similar to CG14980 CG14980-PB [Tribolium castaneum]
NCBI nr blastpgi|3504182978e-7539.95%PREDICTED: vacuolar fusion protein CCZ1 homolog [Bombus impatiens]
NCBI nr blastxgi|3504182971e-7439.95%PREDICTED: vacuolar fusion protein CCZ1 homolog [Bombus impatiens]
Group
KEGG pathway 
InterPro domain[152-300] IPR0131762.3e-14Protein of unknown function DUF1712, fungi
Orthology groupMCL12042 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207786-TA
ATGGTTCTGGTAGTCAGAATTCCATATGCAGCTAAAACTCCTTCTACACCAGGGGAGAGTAAGGAGGTGATTGAAACATCTGTTATATATGACTTATTAGTGTCAGCATACAAAATGTTCAGAATGTTTGTTGGTCCATTTAAAGACATACCACCTGAAGATATTTATACAAAATGTGAACAATTCTTCACACCATACATAATGTCCAGAAATATTGCAAATGACTTAAGTAATATTATACAAGGGATAAATTATTTACCGCTAGAAAAGAATTCATTCTTTAAAGTGGTGTGTTTCATAGATTTACTGGAAATTACTTACCCGGATTTTAAATGTGTATCTTTTGTCTACAATGAACAGCTTATATGGAATGGACTTGCAACAAATGATATGTTAACATTATATCAATACTTAGTACAAACTTTACTTCCCAAACAAGTTGAGAAGGAAATACAAGGTGGAGCTGTAACAGCTGCGGTAAGACATGGCCGCTTCATAAGTCCCCCAGAAGGTATTCGTACTTCGGAAGATCTTAAAAAACTACATAAGGTGTACTTAATGAGAGAAGATGATACTGAAATGAAACAATATTATTTAATAATATACAGAACCCTCAGTGCAACAGTCTGCTTCACAATTGATGTAAACACTACCCTCGACCTAGATACGTTCAAATCTCTAGATGCATTTATTGGGCCCAAACTATCTACAATAGCATCCTCTATCAGTGAGCAATGTGCGGTTCATGCTTTACAAAATGCACAACTTTCAAGCTCGGAACACAAGTTTTTGTACTTCAATCGACTGAATCTGGCATGTAAAACTTCAACCCCTCCGTCACTGACACCATCTACAGCAGTTAAACCAGAAGTTTTAAGTATTATAGCTGGTATACATGCTGACAGAAAAAGTCTCGGGAATTATGGTGAGATTATTATAAAAACCCCCGACGAGTACTGGATCACGGGAAAAAGTTCCAATGATAGAGAATTTTATGTCATAATACAAGAAAAAAATGCAAACTTAAAAGATATTGCGGATGAGGTGAAAAGAGTATGTGAAGAGCAAATGAAAGGCATTTTCTTCTATCCTATTTAA

Protein sequence:

>DPOGS207786-PA
MVLVVRIPYAAKTPSTPGESKEVIETSVIYDLLVSAYKMFRMFVGPFKDIPPEDIYTKCEQFFTPYIMSRNIANDLSNIIQGINYLPLEKNSFFKVVCFIDLLEITYPDFKCVSFVYNEQLIWNGLATNDMLTLYQYLVQTLLPKQVEKEIQGGAVTAAVRHGRFISPPEGIRTSEDLKKLHKVYLMREDDTEMKQYYLIIYRTLSATVCFTIDVNTTLDLDTFKSLDAFIGPKLSTIASSISEQCAVHALQNAQLSSSEHKFLYFNRLNLACKTSTPPSLTPSTAVKPEVLSIIAGIHADRKSLGNYGEIIIKTPDEYWITGKSSNDREFYVIIQEKNANLKDIADEVKRVCEEQMKGIFFYPI-