Monarch geneset OGS2.0

DPOGS206927
TranscriptDPOGS206927-TA933 bp
ProteinDPOGS206927-PA310 aa
Genomic positionDPSCF300001 - 1124422-1127139
RNAseq coverage598x (Rank: top 21%)
Annotation
HeliconiusHMEL0154631e-16990.00% 
BombyxBGIBMGA012903-TA3e-13485.82% 
DrosophilaUfd1-like-PA2e-10157.94% 
EBI UniRef50UniRef50_Q9VTF93e-9957.94%Ubiquitin fusion degradation protein 1 homolog n=16 Tax=Pancrustacea RepID=UFD1_DROME
NCBI RefSeqXP_001605370.17e-12169.41%PREDICTED: similar to ubiquitin fusion degradaton protein [Nasonia vitripennis]
NCBI nr blastpgi|3454799983e-11969.41%PREDICTED: ubiquitin fusion degradation protein 1 homolog [Nasonia vitripennis]
NCBI nr blastxgi|3072122871e-11871.10%Ubiquitin fusion degradation protein 1-like protein [Harpegnathos saltator]
Group
Gene OntologyGO:00065117.9e-160ubiquitin-dependent protein catabolic process
KEGG pathwaynvi:1001217642e-120 
 K14016 (UFD1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[1-310] IPR0048547.9e-160Ubiquitin fusion degradation protein UFD1
Orthology groupMCL14153 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206927-TA
ATGTTTCAGTTGGGGTTCAATATGTTTCATGAAATTGCAAGAAATTTTAACATGACCTATCGTTGCTACTCAGTTTCTATGCTACCTGGAAACGAAAGGCAAGATGTTGAAAGGGGAGGAAAAATTATTATGCCACCATCAGCTTTGGAGCTGCTAACTCGTCTTAACATCGAATACCCAATGATATTCAAGTTAACAAATAAGAAGACAAAAAGAATTACACATTGTGGTGTTTTGGAGTTTGTTGCTGATGAGGGAAAGGTTTATTTGCCACACTGGATGATGGCTAATTTAGTACTAGAAGAAGGTACCCTGGTGCAAATAGAGAGTGTCTCCCTACCAGTTGCTACATTCTCGAAGTTCCAACCACTTTCAGAAGACTTTCTCGACATTTCCAATCAAAAAGCAGTGTTGGAAAACTGTTTACGTAACTTCTCATGCCTGACTACAGGAGATGTCATAGCAATAAAGTATAATTCTAAAGTGTATGAACTCTGTGTGTTGGAGACGAAACCCGGAAACGCTGTGATTATTATTGAATGTGATATGAATGTTGAGTTTGCTCCACCGGTTGGGTATAAAGAAGAAGATCATATAACAAAAGGTGAAGGTAGCTCGGATATGGGTCGCATGGATGAAGACCCAGCTTCTATGATGCCGGAGCCGAGTGGTTTTGTGGCATTCAGAGGTGAAGGGAACAGGCTTGATGGTAAAAAGAAAAAGCTAACCAGCGAAAGTGAATCCGAACCACAGGCATCCAATTCCAGACAGCCATATGTTCGTGGAATACCGGATTACGACTATGTTATTGGTACACTCAGATTTATCAGAAATTCAAGGCCCCCGAGCGCGAAAGAAGAAGTACAAACAGAACCATTCCAAGCCTTTAAAGGGGAGGGTTTCACATTGCGTACTGCCAAGTCGAAGAACTGA

Protein sequence:

>DPOGS206927-PA
MFQLGFNMFHEIARNFNMTYRCYSVSMLPGNERQDVERGGKIIMPPSALELLTRLNIEYPMIFKLTNKKTKRITHCGVLEFVADEGKVYLPHWMMANLVLEEGTLVQIESVSLPVATFSKFQPLSEDFLDISNQKAVLENCLRNFSCLTTGDVIAIKYNSKVYELCVLETKPGNAVIIIECDMNVEFAPPVGYKEEDHITKGEGSSDMGRMDEDPASMMPEPSGFVAFRGEGNRLDGKKKKLTSESESEPQASNSRQPYVRGIPDYDYVIGTLRFIRNSRPPSAKEEVQTEPFQAFKGEGFTLRTAKSKN-