Monarch geneset OGS2.0

DPOGS212751
TranscriptDPOGS212751-TA1344 bp
ProteinDPOGS212751-PA447 aa
Genomic positionDPSCF300012 + 662854-668348
RNAseq coverage283x (Rank: top 39%)
Annotation
HeliconiusHMEL0141562e-9884.40% 
BombyxBGIBMGA013202-TA6e-11978.39% 
DrosophilaCG5555-PA2e-8534.92% 
EBI UniRef50UniRef50_Q9VDZ12e-8334.92%CG5555 n=14 Tax=Diptera RepID=Q9VDZ1_DROME
NCBI RefSeqXP_001994664.13e-8734.25%GH14868 [Drosophila grimshawi]
NCBI nr blastpgi|1950555185e-8634.25%GH14868 [Drosophila grimshawi]
NCBI nr blastxgi|1892382567e-8548.58%PREDICTED: similar to BRCA1-associated protein [Tribolium castaneum]
Group
Gene OntologyGO:00055155.7e-07protein binding
GO:00082705.7e-07zinc ion binding
KEGG pathway 
InterPro domain[80-196] IPR0114227e-27BRCA1-associated 2
[205-253] IPR0130832e-08Zinc finger, RING/FYVE/PHD-type
[213-252] IPR0018415.7e-07Zinc finger, RING-type
Orthology groupMCL14177 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212751-TA
ATGTCTGTATCTTTGTGTGTTTTACGAATTGAAATTGACGAAGATCTGCCAGAGATGGCTAATAATCTTGCCGAAGTCGAACATGCCCAAGAACAAAAGAAGAAGGATCGCGGGGCTCGGGAATCAAAAACTATCACAGTAGAAACATATGCTAGTGTAGTTTCTGGTCGATCTTCCGGAACCGGTATACTGCCAACCAGCTCGCGCAGATCTCCACCTGAGGCTAGCAATGACCGAGAAACCATCAGCTTCTTCTGTGGAAATCCCCTCGTTGAAGTTACAAAGGGGGTCTTACATTTGTATAAAGAAAATGAGTTAAAAGAAACTGAAGAGGCCAAGACCCTCTGTCTCCTGTCGGTTCCCGGTGCGCTAGGAGCTGCTGATCTCTTAGCGTTTGCGGCTGCATGCCAACAGGACATTTGTCACGTGAGAGTACTAAGAGACGGCTCGCCAGATCATTACATGGCTTTGTTTACATTCCGCACGTACGATGCTGCCCGTGAGTTCCACACAGCTTTTTCGGGTGTTCCGTACAGTAGCCTGGAACCTCAGGCCCTGTGCCACGTGGCTTGGGTCTCCAGGGTTGAGTATGCTCGCTCGGGGACTCCGCCACCCGCACACACCGAGCTGCCCACCTGTCCCGTCTGCCTCGAGCGTATGGACGAGAGCGTGGCTGGTGTCTTGAGTGTACAGTGCTCGCACTCCTTCCACGCGGACTGTCTCGTACGTTGGAGTGACGCCCGCTGTCCTGTCTGCCGCTGTGCACAAACACCAGAGCCAAGGGAGAGAGCTGTCTGTCTTCAGTGTGAAATCGAGGGAGGGCCTGTGGAGGAGGGGGGGCCCATGGGTGAGGGGGAGGGAGCAGGGGAGGCGCTGGAAGCGTATGGCTCGCTCTGGATCTGTCTGATATGTGGACATGTCGGCTGTGGAAGACTGGAAATTGAACACCGTGCCACGACCGCGGCTGGTGTACGTCGAGCTGAGACCGCCGAGTCAGAGGTCAGCGAACTTAGGTTGAAACTTAGCATTCTCACCAGGGAACACGTGGCCGTCGAGAGGCGGATGCAGAGCCTAGCCAATAAGGTAACGACACTACAAACTGAGCTGCAAGAAGAGCGCGGGCTGGCTTCAGCGCTGGCGTCCTCGCACAAACAGTGGCAGGAACAGGCGAGGAAAACGGAGGAGGCTCTCACTAAAGAGGTGAGCGAGCTGAAGGAAGAACTCAGGGATGTGATGTTCTTCGTGGAGGCGCGTACGTCGCTGGCAGCCGCAGCGCCAGCCGAGGAATTAGCTGAAGCTACAGTCACCGTCGCACCGGCCAAGCCGAGGCGCAAGCGGCGATAG

Protein sequence:

>DPOGS212751-PA
MSVSLCVLRIEIDEDLPEMANNLAEVEHAQEQKKKDRGARESKTITVETYASVVSGRSSGTGILPTSSRRSPPEASNDRETISFFCGNPLVEVTKGVLHLYKENELKETEEAKTLCLLSVPGALGAADLLAFAAACQQDICHVRVLRDGSPDHYMALFTFRTYDAAREFHTAFSGVPYSSLEPQALCHVAWVSRVEYARSGTPPPAHTELPTCPVCLERMDESVAGVLSVQCSHSFHADCLVRWSDARCPVCRCAQTPEPRERAVCLQCEIEGGPVEEGGPMGEGEGAGEALEAYGSLWICLICGHVGCGRLEIEHRATTAAGVRRAETAESEVSELRLKLSILTREHVAVERRMQSLANKVTTLQTELQEERGLASALASSHKQWQEQARKTEEALTKEVSELKEELRDVMFFVEARTSLAAAAPAEELAEATVTVAPAKPRRKRR-