Monarch geneset OGS2.0

DPOGS202770
TranscriptDPOGS202770-TA1917 bp
ProteinDPOGS202770-PA638 aa
Genomic positionDPSCF300018 - 1083811-1085995
RNAseq coverage425x (Rank: top 29%)
Annotation
HeliconiusHMEL0026898e-9259.25% 
BombyxBGIBMGA010493-TA3e-9367.76% 
DrosophilaCG15141-PA6e-5742.80% 
EBI UniRef50UniRef50_E3WTS32e-6552.08%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WTS3_ANODA
NCBI RefSeqXP_001197926.13e-6355.07%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
NCBI nr blastpgi|3123811066e-6552.08%hypothetical protein AND_06667 [Anopheles darlingi]
NCBI nr blastxgi|3123811061e-6647.87%hypothetical protein AND_06667 [Anopheles darlingi]
Group
Gene OntologyGO:00082701.6e-13zinc ion binding
GO:00048421.6e-13ubiquitin-protein ligase activity
KEGG pathway 
InterPro domain[26-98] IPR0031261.6e-13Zinc finger, N-recognin
[106-189] IPR0110116.3e-13Zinc finger, FYVE/PHD-type
[25-100] IPR0139931.9e-12Zinc finger, N-recognin, metazoa
[112-173] IPR0130831.6e-09Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL13965 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202770-TA
ATGATGGACGTCTTGCAGGAACAAGAAAATTTTGAAGAAGATGCTAACGCTGTGTTGGGAGGTTCCGATGATAAAAACTGTACATATTCTAAGGGCTACATAAAGAGACAAGCATTGTACGCCTGCATGACCTGCTGCTCTGAAGCGAAATCTGACCCAGCTAAGAGAGCAGGTCTTTGTCTAGCCTGCAGCCTCACTTGTCATGAAAATCATGAACTTATAGAACTGTATACTAAACGCAATTTTAGATGTGACTGCGGTAACTCAAAATTCAACTCTAATCCTTGTCAGTTAGCACCCAAAAAAGCAAATTTTAATGAGGAGAATAGTTACAACCAGAATTTTAGTGGAGTATACTGTGTGTGTCGGAGGCCATACCCCGATCCAGATTGTGAAACTGAAGATGTAATGATCCAATGTACCATATGCGAGGACTGGTACCACGGCACACATTTAGAAACAACTGTCCCTAACAGTGAACTCTACACAGAGATGATTTGCAAAGGATGTATGGAAAAATATGACTTTTTACATTCCTACAGTTACATGGTTGTAAATGTTGAAAGCTCCGATGTTGATGTCATTAATGTTCCTGAGAATGGAATTAAAACTCGCAATGGAGACTTCAAAACAGATGCCACAGCTGTTGAAGATAGTGAAAGGTCTCAGGAAAATGAAGATATTAGCCTCACTCCTAAAAAAGAAATTTCTTCTATTGATGAAAACATTGAAGAAAATAAAAAGAAAGTAGAAAATGATGGAACAAATAATTCTAAGATGGAAGGTATTTCTGATGTTGATGTTAGTGTTGAGAATCCTTCAAGTGAAAGTCTCATAAGTTGTAACAATAAAGACAACACTGATATTAAAGAGGAGGGAAGCAATGCAGACAATACCAATGACCGAGATACTAGTAGTGACCAGAGCCAAGATATTATCAATAGTGAGATCCAACGAGACATGGAACTAAACAAAAATAATACTGCAAAAGAAATCGAAGACAAAAATGCTATGAAAACAACTAGCGAGGTGAAAAATAGACAAGATGAAAATACAGATGAAGAGAAGCCTCTGGTAGATTATGAAAGTGAAAGCTGCAAGTCAAAAATGAATTTAGATATCACAGAAACAAGTCAGGAAGATATTAAGACTACACATGAGAACGGGAAAATGTATAAAAAAAATGATGCTGACAATACTATCAACAGAACAAGTGAGTTAAATAACTTAGAAAAAGGAGAAGGGACAGAGGAAAAAACGGAAAACTTGGCATCTCATGATGAAAAAGATGTGGTACAAAACACATCAAAAGGAAAGGAAACAAAAAATGAAGGTGGTGGTAACTGTAGCAATCCTGTTGATGGTAATTATACAGACGCTGCTACTGATGAAGTAACAGGGGACAGCAATCACAAAGGATCAGAAAAAAGAAAACTTTCCACAGAAGAAACAACAGATAGTTCAGTGAGTAAGAAAAGTAAATTAGGAGAGGTGACTGACAAACCATGCACTTGTCCTAAAAATGACAAAAAAGTGTACAGAGGAGCAACATTCTGGCCCTCAACCTTCCGCCAGAGACTCTGCACATGCAATGAATGTCTGAGCATGTATAAGGACCTGTCTGTTATGTTCCTTATGGACACTGAAGACACAGTCGTCGCCTACGAGAGCTTGGGCAAGGAGAAAACCAACGGTAAGCCATCACAGTATGAAAAGGGGCTCCAAGCACTTTCATCGCTGGATAGAATCCAACAGATCAATGCCTTGACAGAGTACAACAAAATGAGAGACAAGCTATTAGACTTCCTTAAAAGCTTCAAGGACAGGAAAGAAATTGTCAAGGAGGAAGACATCAAAGCATTCTTTGCCGGAATGAAGCCCAAGAGGGAACCAGAGGGTGTGTACTTTTGTCGGTGA

Protein sequence:

>DPOGS202770-PA
MMDVLQEQENFEEDANAVLGGSDDKNCTYSKGYIKRQALYACMTCCSEAKSDPAKRAGLCLACSLTCHENHELIELYTKRNFRCDCGNSKFNSNPCQLAPKKANFNEENSYNQNFSGVYCVCRRPYPDPDCETEDVMIQCTICEDWYHGTHLETTVPNSELYTEMICKGCMEKYDFLHSYSYMVVNVESSDVDVINVPENGIKTRNGDFKTDATAVEDSERSQENEDISLTPKKEISSIDENIEENKKKVENDGTNNSKMEGISDVDVSVENPSSESLISCNNKDNTDIKEEGSNADNTNDRDTSSDQSQDIINSEIQRDMELNKNNTAKEIEDKNAMKTTSEVKNRQDENTDEEKPLVDYESESCKSKMNLDITETSQEDIKTTHENGKMYKKNDADNTINRTSELNNLEKGEGTEEKTENLASHDEKDVVQNTSKGKETKNEGGGNCSNPVDGNYTDAATDEVTGDSNHKGSEKRKLSTEETTDSSVSKKSKLGEVTDKPCTCPKNDKKVYRGATFWPSTFRQRLCTCNECLSMYKDLSVMFLMDTEDTVVAYESLGKEKTNGKPSQYEKGLQALSSLDRIQQINALTEYNKMRDKLLDFLKSFKDRKEIVKEEDIKAFFAGMKPKREPEGVYFCR-