Monarch geneset OGS2.0

DPOGS206467
TranscriptDPOGS206467-TA1209 bp
ProteinDPOGS206467-PA402 aa
Genomic positionDPSCF300070 + 168282-169490
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0129460.092.02% 
BombyxBGIBMGA005464-TA0.080.55% 
DrosophilaCG4953-PA4e-9746.40% 
EBI UniRef50UniRef50_A5PLN92e-11852.61%UPF0533 protein C5orf44 n=65 Tax=Coelomata RepID=CE044_HUMAN
NCBI RefSeqXP_001599918.11e-13658.56%PREDICTED: similar to LOC549181 protein [Nasonia vitripennis]
NCBI nr blastpgi|3407099981e-13659.01%PREDICTED: UPF0533 protein C5orf44 homolog [Bombus terrestris]
NCBI nr blastxgi|3407099983e-13358.81%PREDICTED: UPF0533 protein C5orf44 homolog [Bombus terrestris]
Group
KEGG pathway 
InterPro domain[3-401] IPR0103781.3e-171Protein of unknown function DUF974
Orthology groupMCL12797 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206467-TA
ATGGAAAATAAAGAATCTGTCGAACATTTGGTGGCCCTTAAAGTTATGAGACTTACGAAACCAGCACTAATAAGCCCTAAGATAGTTACTTGTGATTTTAAAGATTTGCCTGGTAATATTTTAAACAACTTTTTAAAAGATGATGCGACTTCGGTGGTTCAAATGGAAACACTAGCTGCTGGACAGTTTTTGTTATTGCCACAAAGTTTCGGTAATATATACCTGGGTGAAACATTTTCGTGCTATGTATGTGTGCACAATGAGACAAATCAACCAGTGCAAAGTGTATCTATCAAGGCTGACTTGCAGACAAGTTCTCAGAGAATTCCATTAACTACTCAACAAAACCAGTCTCCTATCATGCTAGATGTTGATGAAACACTCAGTGATGTGATACATCATGAAGTTAAGGATTTGGGTACACATATCCTTGTCTGCGAAGTTACATATATGTCAAACTATAGTACACTAGCATCATTTAGGAAGTTCTTTAAGTTTGAAGTACTAAAACCTTTGGATGTCAAAACTAAATTCTATAATGCTGAATCAGATGATGTCTTTGTAGAAGCTCAAGTACAAAATATAACATCTGGGCCAATTATACTAGAGACAGTTGCTTTGGAGAGCTCCCATCAATTCACAGTTAAGTCATTGAATGAAGATGACAATGGTGTATCAGTTTTTGGTGATGTGACTTTATTGCAACCACAGGAAAGTTGTCAGTACTCATATTGTCTTACACCAAAAGAGAACATATTGAAAGATATCAAATTATTGGCAGCAGCTAAAAATATTGGTAAATTGGATATAGTGTGGAGGTCTAACTTAGGTGAGAAGGGAAGACTGCAAACTAGTCAACTGCAGAGAATGATTCCAGATTATGGAGATATAAGAGTTACTTATGAAAATGTTCCAAGTAGAGTTCCTATTGATGAGCCTTTTAAATTCAATTGTAAAATTGTGAATGCCAGTGAGAGAACACTCGATTTGATTCTCAAACTCCGTTCACTCCAAAATTCTAGCCTTCTTTGGTGTGGTATTTCAAATAGAAAATTAGGGCCTTTGGAACCAGGCAATACCACAATTGTCAATCTTACCGTTCTACCAATAAATTCTGGTCTTCATACTGTAACCGGAGTGTCTTTAGTAGATTTATTCCTAAAAAGGACTTATGATTATGATGATTTAGCTTCTGTATATGTATGTTGA

Protein sequence:

>DPOGS206467-PA
MENKESVEHLVALKVMRLTKPALISPKIVTCDFKDLPGNILNNFLKDDATSVVQMETLAAGQFLLLPQSFGNIYLGETFSCYVCVHNETNQPVQSVSIKADLQTSSQRIPLTTQQNQSPIMLDVDETLSDVIHHEVKDLGTHILVCEVTYMSNYSTLASFRKFFKFEVLKPLDVKTKFYNAESDDVFVEAQVQNITSGPIILETVALESSHQFTVKSLNEDDNGVSVFGDVTLLQPQESCQYSYCLTPKENILKDIKLLAAAKNIGKLDIVWRSNLGEKGRLQTSQLQRMIPDYGDIRVTYENVPSRVPIDEPFKFNCKIVNASERTLDLILKLRSLQNSSLLWCGISNRKLGPLEPGNTTIVNLTVLPINSGLHTVTGVSLVDLFLKRTYDYDDLASVYVC-