Monarch geneset OGS2.0

DPOGS210736
TranscriptDPOGS210736-TA1410 bp
ProteinDPOGS210736-PA469 aa
Genomic positionDPSCF300013 + 203827-207253
RNAseq coverage380x (Rank: top 32%)
Annotation
HeliconiusHMEL0070770.073.50% 
BombyxBGIBMGA006267-TA0.073.89% 
DrosophilaCG13605-PB2e-5031.06% 
EBI UniRef50UniRef50_D6WEC91e-9041.59%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WEC9_TRICA
NCBI RefSeqXP_968664.13e-9141.59%PREDICTED: similar to CG13605 CG13605-PA [Tribolium castaneum]
NCBI nr blastpgi|910794925e-9041.59%PREDICTED: similar to CG13605 CG13605-PA [Tribolium castaneum]
NCBI nr blastxgi|910794921e-8641.12%PREDICTED: similar to CG13605 CG13605-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055154.7e-08protein binding
GO:00082704.7e-08zinc ion binding
KEGG pathwaysbi:SORBI_10g0110706e-09 
 K10601 (SYVN1, HRD1)maps-> Ubiquitin mediated proteolysis
    Protein processing in endoplasmic reticulum
InterPro domain[407-454] IPR0130834.6e-16Zinc finger, RING/FYVE/PHD-type
[410-447] IPR0018414.7e-08Zinc finger, RING-type
[410-447] IPR0189576.2e-08Zinc finger, C3HC4 RING-type
Orthology groupMCL11753 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210736-TA
ATGGCTCAGCCGAGTCAAGTAACTCTTAATATAGAGGATGAGAACCCTGATTCAGTAGAAAATAGTAATCTTAATAATGCACTCCATAGAACGGACCCTCCATCCAGACCAATGGTCGCAAATAGAAGTAATTTAGGCCTCGGAGACAGACTTAATAGTGTTTTCAGGGAAATAAGACCGTTAGTAGAACATGCAAGAATGGGCAACAATTCAAGACTTTCTCTACCAACGTGGTTACCGAGGAATTCACTGGGCATCCACCAATCCGGGGAGGTGACGCAAAGACCACAGAGTTCTATAGCCCATGTAAATCTTGGCTCTACGGCTCAAACATACATTGTAACAGATAGAGGACTTCCCATGTCACCGAGACATCAGAATCACGGGATGAGCAACAGTGCTTCAAATGATTCAAATGTATCAGAAAGAGCACAGGATCAGGTAGACATAAATGTATCGGTGAATCCCTCAAACAATAATAATATTAATGACAATGCAGATAACGATAGTCAGAGTGAAGACGGAACCCAGCAGGTAGTTGATGTTAGAGCCACTTTGAACCTGTTGTTGCGTTATGCTCCGTTCTATATAATTCTATACATTAAATACATGTACGACAGTCGTGAGGGTATATTCACATTTGTGGTATTATTATGCACATTCTCACACGGAAATGGCTTGGTTAAGAGAGAAAATGGGAAGCAGATGAATAGGAGCTTACTAGCATTATTTAGTGAATTTGTATTTGCTACGAGTTCTATACTAATCGTCCACTTCCTGTGCGGTCATGGGAAGCTATTGGAAAATGTGGTGATGTTTCCTGTTTATACGGAGCCGATCACAGTTTGGGAACTGCTATGGCTTGTCATATTAACGGATCTCATTGTTAAGATTATAACGGTCAACATAAAGATCGTGATCACAATGTTGCCAGCTTTCATATTACCGTTCCAGAAGAGGGGTAAAGTGTATTTGTTCACGGAGGTGGTGTCTCAACTGTACCGCTCCATAATAACCATCCAGCCGTGGATCTTCTACCTGATGCAGTCCTACGAGGGCTCCGAGCGTATGGTGGGGATGTTCCTCACAGCGCTGTACGTCATCTCCAAGGTTGTGGAGTTACTGCTGAGGCTGAGGCTGGTGAAAAATGCCACTTGGACCTTGCTGCAGAGCGTCAGTCTGGGCACGAAGCCAACTTGCGAGCAGATGGTTGCCGCTGGTGATTCCTGTCCAATCTGCCACGACGACTACACCACACCAGTCAGGTTGACCTGCAGCCATATCTTCTGCGAGCTTTGCATCTCCGCGTGGTTGGATCGCGAGCACACTTGCCCGCTGTGCCGTGCCAAGGTCGCCGACGAACCGACTTGGAGAGACGGTTCAACCACATACGATTTCCAACTCTGTTAA

Protein sequence:

>DPOGS210736-PA
MAQPSQVTLNIEDENPDSVENSNLNNALHRTDPPSRPMVANRSNLGLGDRLNSVFREIRPLVEHARMGNNSRLSLPTWLPRNSLGIHQSGEVTQRPQSSIAHVNLGSTAQTYIVTDRGLPMSPRHQNHGMSNSASNDSNVSERAQDQVDINVSVNPSNNNNINDNADNDSQSEDGTQQVVDVRATLNLLLRYAPFYIILYIKYMYDSREGIFTFVVLLCTFSHGNGLVKRENGKQMNRSLLALFSEFVFATSSILIVHFLCGHGKLLENVVMFPVYTEPITVWELLWLVILTDLIVKIITVNIKIVITMLPAFILPFQKRGKVYLFTEVVSQLYRSIITIQPWIFYLMQSYEGSERMVGMFLTALYVISKVVELLLRLRLVKNATWTLLQSVSLGTKPTCEQMVAAGDSCPICHDDYTTPVRLTCSHIFCELCISAWLDREHTCPLCRAKVADEPTWRDGSTTYDFQLC-