Monarch geneset OGS2.0

DPOGS202906
TranscriptDPOGS202906-TA996 bp
ProteinDPOGS202906-PA331 aa
Genomic positionDPSCF300126 + 204723-208240
RNAseq coverage543x (Rank: top 23%)
Annotation
HeliconiusHMEL0142532e-10562.54% 
BombyxBGIBMGA004179-TA2e-14977.64% 
DrosophilaCG16947-PA1e-10668.40% 
EBI UniRef50UniRef50_Q179F23e-12376.87%Vitellogenin, putative n=16 Tax=Eumetazoa RepID=Q179F2_AEDAE
NCBI RefSeqXP_001651215.16e-12476.87%vitellogenin, putative [Aedes aegypti]
NCBI nr blastpgi|1571107131e-12276.87%vitellogenin, putative [Aedes aegypti]
NCBI nr blastxgi|1571107134e-13277.01%vitellogenin, putative [Aedes aegypti]
Group
Gene OntologyGO:00082701.9e-16zinc ion binding
GO:00090556.5e-16electron carrier activity
GO:00468726.5e-16metal ion binding
KEGG pathwayame:4085909e-108 
 K10144 (RCHY1, PIRH2)maps-> Ubiquitin mediated proteolysis
    p53 signaling pathway
InterPro domain[27-101] IPR0089131.9e-16Zinc finger, CHY-type
[225-267] IPR0040396.5e-16Rubredoxin-type Fe(Cys)4 protein
[148-209] IPR0130834.5e-11Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL15768 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202906-TA
ATGTCCAAAGAAAACACTGAAACTGAAAATGGTGAATCCTCCGTAACAACTGAAGAGATAAGTGAAAAACGCATAGGTTGTGCTCATTACAAACGTAGAGCTAAATTTGTGACTCCATGCTGCAACAAACTGTACATGTGCCGCTACTGCCATGATGAGAAGGAAGAACATTATTTCAACAGGAAATCTGTAACAGAACTCATTTGTACTGAGTGTGACACAAGACAGAAGGTCCAGGCAAAATGTATTAAATGTGGAGTCACATTTGGAAAGTACACATGTCTGATCTGTAATCTGTTTGATGACGAGGACAAGAAGCAGTATCACTGTGATGGATGTGGCATCTGTAGAGTGGGCGGACGGGATAGGTTCTTTCATTGTGAACGCTGCAACATGTGTCTCCCAGTGCAGTTGCAAGAAGTCGGCCATAGGTGTGTAGAGAATGTGTCCCGTGCAAACTGTCCCGTGTGTCTTGAGGACATACACACATCCCGGATCCCGTGCCACATACCAGACTGCGGCCATCTCCTTCACAGACCGTGCTTTGAACAGATGCTTCGCTCGGGACATTACGCCTGCCCTACCTGCCAGACCAGCATGATTGATATGACCAATCTGTGGAATTATTTGGACTCAGAAGTTGCCGCTACACCGATGCCACCGGAATATGCAAACTATAAGACCACTATACTATGCAAGGATTGTCACAAGTTGTCGACTGTTAAGTTTCATGTGGTCGGTCTGAAGTGTCAACACTGCGGTGGCTACAACACGTGCCAGACAAACGGCTTTCACAAAGATTCTGGTACATCCGACCAGTGCGAGACGAGCACGCCCTCGTCCAGCCGCAGCGGTTCAAGCTCCAGCCAACCGGACACGACCGAGCAACGTGACGAGAACGACACGACAGACACGCGGGAAGAACACGACACGACCGACACACACGGACCCGACCTGCCGCGAGCCAATCCGCTGGACGAACCACCGCAAGCCTGA

Protein sequence:

>DPOGS202906-PA
MSKENTETENGESSVTTEEISEKRIGCAHYKRRAKFVTPCCNKLYMCRYCHDEKEEHYFNRKSVTELICTECDTRQKVQAKCIKCGVTFGKYTCLICNLFDDEDKKQYHCDGCGICRVGGRDRFFHCERCNMCLPVQLQEVGHRCVENVSRANCPVCLEDIHTSRIPCHIPDCGHLLHRPCFEQMLRSGHYACPTCQTSMIDMTNLWNYLDSEVAATPMPPEYANYKTTILCKDCHKLSTVKFHVVGLKCQHCGGYNTCQTNGFHKDSGTSDQCETSTPSSSRSGSSSSQPDTTEQRDENDTTDTREEHDTTDTHGPDLPRANPLDEPPQA-