Monarch geneset OGS2.0

DPOGS210777
TranscriptDPOGS210777-TA1725 bp
ProteinDPOGS210777-PA574 aa
Genomic positionDPSCF300386 - 54209-84162
RNAseq coverage1058x (Rank: top 12%)
Annotation
HeliconiusHMEL0165443e-15066.59% 
BombyxBGIBMGA004215-TA2e-7687.18% 
DrosophilaCG2991-PB4e-10538.77% 
EBI UniRef50UniRef50_Q7PVU75e-12644.73%AGAP009181-PA n=2 Tax=Pancrustacea RepID=Q7PVU7_ANOGA
NCBI RefSeqXP_001602313.14e-14345.97%PREDICTED: similar to conserved hypothetical protein, partial [Nasonia vitripennis]
NCBI nr blastpgi|1950308001e-12644.06%GH10682 [Drosophila grimshawi]
NCBI nr blastxgi|2700098882e-11441.91%hypothetical protein TcasGA2_TC009208 [Tribolium castaneum]
Group
Gene OntologyGO:00082703.2e-20zinc ion binding
KEGG pathwaytca:6572788e-114 
 K03871 (VHL)maps-> Pathways in cancer
    Ubiquitin mediated proteolysis
    Renal cell carcinoma
InterPro domain[397-448] IPR0110163.2e-20Zinc finger, RING-CH-type
[395-453] IPR0130833.8e-15Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL15770 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210777-TA
ATGCGTTCTCTGTTAGCGTATTACGCTGATGCAGTGATGTTGGCTGCGAGGGGCACAGGCTGTGCTCTCGGCTGCAGCGGCCGCGGGGATTGCATGAACGGTACCTGCCTCTGTGAGATAAGATACTCGGGAGATGAATGCGCCGGTCCCAACTTGCCTTACCACGCCTGTATCGGTGGAGTTTTCCTCATGGTGGCGTTCGTGTGTGCCGTTCAGCTGACTGTGTGCGTGGTGACGGAGTATAAACGTCTGAAAGCCCCAACTATCCTCAGGGCATGTAAGGTCACCACACAGAAGATGCTCTACTTCGTAGCGTTCCTCGCGTCATTCATCCGCGGAGCATACTTTGTGTCACCCTCTGCCTTCCAAGAGGGTTGGGCAACCAGTCTTCTGTCAGCGTACTACCCTCTCATGATATCTGGGTCTTCGCTCATAGTCTGCTTCTGGGCTGAGGTGAGTCTAGACTTAATAACACTTCTTCGCGGAGAATTTCTCAAAGAGACTAAAGTCTGCACGATCATACTACCAAGACCCGGCCCATCTACGGCCGCTGAGGGTGACGACGTTCAAGATACGCTGGTCGCTAAGCAGGGTGTCCAGACAGATTTATCTGATGGCGATGGCAGTATAGACCCTCGATCAGTAAATCTGTCCCAGCTTCATCAGTCGCGGCTGGGGCTGGTCAGTCAGGCTCTCATGTTGATACTCATAGCTGGTTTCCTCGCATCGGAAACTCTCAGCGAATTCTGGAAGACGAAAGTGCCAGTAGTGTCACGTAATTGGCACGACCTGGTGTTCCGTTTGGCGGAGATTGGTGTAGCGCTATGGTTCCCGTGTGTGCTCTGGAACTGTATGGCTCCCGAACGTCTTTGGTTGTTGAACCCGCGCCGGTTGTTGGCGCGCCAGTTGGACGATGCCAGCTTGGCGGATCTATTGGCTAACAAGCGACCGGCTGATGCCAAGCCGGATATGACAGCCGATCTGAATTCCCAGAGCTTAGCCGAGTTTTCTCAACTGTCGTGGCGCGATAAGGCGCTATATGTCTGTCGGCTGGTCCAACATTGGCGATGGTGTCTTTGGACAGTTTTTCTCACCATGTTCGTTTATTCGAAACGCGTGAAATCTATCGCTGGGGCCGCTCCGAATGTCGAGACGGACTCTCTGGTTGGCAGCGTGGGTTCCCACCGCGACTGCTGGATCTGTTACGACAGCTCGCGCCAGGAGCCTCTCATCACGCCTTGCAGGTGTACAGGGGATGTCGCGGCTGTTCATCACGACTGCCTGAGTCGATGGCTCGTGGAGAGCGCTGCGACTCCGGACGGTCTCAAGTGTAAAGTATGCAACACGCCGTACATAGTTCAAGAGACGAACAGGGTTGAATGGGAGCGAGGGTTTACAGCTACACACTGGGTCCGCACAGGTCTTTGTGTGATGGCGATGTGTGGTGCTGGAGGTGCCGCCTGGGTCCTCGTCCAGCTGTTCCCTGCGCCTGTTCCTAGAGTTCTGGCAGCTGGAGCCGCCCTCCTCATATGCTATGTGGCCATCAGGTTTCTCAGCGTGAATACTGTGACAGCGTACCAGCGGGCGAAGGTCTCTTCCCTGCGCATCCTGACTGAACCCGTGGACGCATCAGACACACAACTATCGACTATCAGCAAGACGGTCACAGTTGATATACCGTCCAAGGCGGTACTGGAGCAGGCCTTGAAGGGAGACGTAAAGTAA

Protein sequence:

>DPOGS210777-PA
MRSLLAYYADAVMLAARGTGCALGCSGRGDCMNGTCLCEIRYSGDECAGPNLPYHACIGGVFLMVAFVCAVQLTVCVVTEYKRLKAPTILRACKVTTQKMLYFVAFLASFIRGAYFVSPSAFQEGWATSLLSAYYPLMISGSSLIVCFWAEVSLDLITLLRGEFLKETKVCTIILPRPGPSTAAEGDDVQDTLVAKQGVQTDLSDGDGSIDPRSVNLSQLHQSRLGLVSQALMLILIAGFLASETLSEFWKTKVPVVSRNWHDLVFRLAEIGVALWFPCVLWNCMAPERLWLLNPRRLLARQLDDASLADLLANKRPADAKPDMTADLNSQSLAEFSQLSWRDKALYVCRLVQHWRWCLWTVFLTMFVYSKRVKSIAGAAPNVETDSLVGSVGSHRDCWICYDSSRQEPLITPCRCTGDVAAVHHDCLSRWLVESAATPDGLKCKVCNTPYIVQETNRVEWERGFTATHWVRTGLCVMAMCGAGGAAWVLVQLFPAPVPRVLAAGAALLICYVAIRFLSVNTVTAYQRAKVSSLRILTEPVDASDTQLSTISKTVTVDIPSKAVLEQALKGDVK-