Monarch geneset OGS2.0

DPOGS200897
TranscriptDPOGS200897-TA1578 bp
ProteinDPOGS200897-PA525 aa
Genomic positionDPSCF300066 - 115521-118572
RNAseq coverage657x (Rank: top 19%)
Annotation
HeliconiusHMEL0098216e-12146.09% 
BombyxBGIBMGA000548-TA4e-6482.09% 
DrosophilaCG10338-PA5e-8439.02% 
EBI UniRef50UniRef50_E2BJZ54e-8738.85%UPF0420 protein C16orf58 n=1 Tax=Harpegnathos saltator RepID=E2BJZ5_HARSA
NCBI RefSeqXP_968556.26e-8940.05%PREDICTED: similar to UPF0420 protein C16orf58 homolog [Tribolium castaneum]
NCBI nr blastpgi|3407112437e-9139.41%PREDICTED: UPF0420 protein C16orf58 homolog [Bombus terrestris]
NCBI nr blastxgi|3287787724e-8939.78%PREDICTED: UPF0420 protein C16orf58 homolog [Apis mellifera]
Group
KEGG pathway 
InterPro domain[46-339] IPR0069687.5e-114Protein of unknown function DUF647
Orthology groupMCL14300 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200897-TA
ATGTCTTCAAACGAGGGAGAGATATTGCTACAAGAAAAATATGGAACATCTGCTAAGGAGAGGTATTATGTTAAAGCCGCAGATCAATTGCCAATAGTATTAGTAGTCAATGAGAAGTCTCGCGACGTCGCTGGATTATTCGCTAAAATATTTCTCCCCCAAGGATATCCTAACAGTGTCAGCAAAGATTACATTTTTTACCAAATTTGGGACACTGCTCAAGCATTTTGCAGCACTATTACAGGTATACTGGCCACACAGGAAGTTTTCCGTGGGGTGGGAGTGGGAGATACAAATGCTTCACCATTAGCAGCCACTGTTACTTGGGTGTTCAAAGATGGCTGTGGGCATATTGGGAAAATATTATTCGCTTATACCCATGGAACATATTTAGATGCCTATAGCAAAAAATGGCGTCTGTATGCAGATACATTAAATGATGCTGCCATGTGTATTGAAATAGCACTACCATTGTTCAAGAATTATATTACATTTGCTCTCTGCGTCAGCACTTGTATGAAGGCTATTGTCGGAGTTGCGGGGGGTGCTACCAGAGTGGCAATGACCCAACACCATGCTCTCCGTGGTAATCTCGCTGATGTATCAGCTAAAGACTCCGCCCAAGAGACTGCTGTTAATCTTATAGCTTCTTTCGCTGCACTGTTCCTAATATCTTTGATAGGGAATTCGGTGACGATATTTATAATATTATTAATTATGCATATTGTATTCAACTACATGGCAGTTCGGGCAGTTTGTTTACGAACACTGAATGAACCCCGTTTCTTACAAGTAATTGACACATACTTGCGGAAGGAGGTAATTGCCAACCCATGTGAAATAAATCGTAACGAACCCATTATTTTCTATCAACTGGGACCCAATTTGTTAGATTTAAAAATATGCGGTTTTCATATCATAATTGGCGACTCGATATCGAAGATTTTAAACCCAAGAACTAATGCAGTGTATATAAACAAAGTAAAAGATATTTATAACGATAAGAAATACATAATTCATCCTGATACCGGAAACAGAGTGATGTACGTTTTTCCAAAGGAAGATGCGTCGGTAGACGACATGCTATGCGCTTACTTTCAGTCTGTTTTGCTTGCGATTATTACTTGTGCTATTAACGACCACCAATTGGCTATATTCAGCTCCAATAATAACACGAAGCCATTCGCTCAAGTGTGTGTGACACTACAATCAGCTGAGTGGAGCCGGGCTACCGGTTCCGGGGGAGACTTTCAATATGAACCGTCTTATGATCTGCATCGTTATGTTAAGAATATAGCTAGCGATGAATGGACAGCCATCAGAGAAGGTCTTTTGCAGACGGGTTGGGATCTAAGCAAGCATTTATTGATAGTAGATGAATGGCGATTATGTAGTGAAAATGTCACTCCTGTAGCTATACTACCTGAAGAAGTGAAGTACAATCGCCCGATCGCTATACCAGAAACTCGCAAAGAATCTTTCACGATAGAACCGGACACATCGGATAGCACACTCAGCAATATACCAGAAGCCACAAAATCGAAAACCGATTTAAACTATCGTTTAAAAAAGGAATGA

Protein sequence:

>DPOGS200897-PA
MSSNEGEILLQEKYGTSAKERYYVKAADQLPIVLVVNEKSRDVAGLFAKIFLPQGYPNSVSKDYIFYQIWDTAQAFCSTITGILATQEVFRGVGVGDTNASPLAATVTWVFKDGCGHIGKILFAYTHGTYLDAYSKKWRLYADTLNDAAMCIEIALPLFKNYITFALCVSTCMKAIVGVAGGATRVAMTQHHALRGNLADVSAKDSAQETAVNLIASFAALFLISLIGNSVTIFIILLIMHIVFNYMAVRAVCLRTLNEPRFLQVIDTYLRKEVIANPCEINRNEPIIFYQLGPNLLDLKICGFHIIIGDSISKILNPRTNAVYINKVKDIYNDKKYIIHPDTGNRVMYVFPKEDASVDDMLCAYFQSVLLAIITCAINDHQLAIFSSNNNTKPFAQVCVTLQSAEWSRATGSGGDFQYEPSYDLHRYVKNIASDEWTAIREGLLQTGWDLSKHLLIVDEWRLCSENVTPVAILPEEVKYNRPIAIPETRKESFTIEPDTSDSTLSNIPEATKSKTDLNYRLKKE-