Monarch geneset OGS2.0

DPOGS200134
TranscriptDPOGS200134-TA2511 bp
ProteinDPOGS200134-PA836 aa
Genomic positionDPSCF300128 - 633065-639843
RNAseq coverage380x (Rank: top 31%)
Annotation
HeliconiusHMEL0094400.078.28% 
BombyxBGIBMGA002780-TA0.080.83% 
Drosophilabon-PA2e-0921.28% 
EBI UniRef50UniRef50_F4WYL90.080.04%E3 ubiquitin-protein ligase TRIM37 n=13 Tax=Coelomata RepID=F4WYL9_ACREC
NCBI RefSeqXP_396821.20.076.53%PREDICTED: similar to tripartite motif protein 37 [Apis mellifera]
NCBI nr blastpgi|3287788780.076.53%PREDICTED: hypothetical protein LOC413376 [Apis mellifera]
NCBI nr blastxgi|3227965570.058.61%hypothetical protein SINV_07059 [Solenopsis invicta]
Group
Gene OntologyGO:00055151.4e-21protein binding
GO:00056228.7e-18intracellular
GO:00082702.4e-10zinc ion binding
KEGG pathwayame:4133760.0 
 K10608 (TRIM37, MUL)maps-> Ubiquitin mediated proteolysis
InterPro domain[298-424] IPR0133221.5e-23TRAF-type
[292-423] IPR0089741.4e-21TRAF-like
[143-265] IPR0036498.7e-18B-box, C-terminal
[13-112] IPR0130832.6e-12Zinc finger, RING/FYVE/PHD-type
[104-143] IPR0003152.4e-10Zinc finger, B-box
[295-401] IPR0020831.2e-06MATH
Orthology groupMCL14782 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200134-TA
ATGGCTACAAGAGGTGATAAAGGAAATGGAAATGGCGAGGAACAAATCGTTGAGACGTTGGCAGAGGTGTTCCGGTGCTTCATCTGTATGGAGAAGCTGGTGGACGCCCACCTCTGTCCGCACTGTTCCAAGCTTTGTTGTTATGCTTGCGTGAGGCGATGGCTTACGGAACAAAGATCCCAGTGTCCTCACTGCCGAGCAGCTTTGCATCTTCATGAGTTAGTCAATTGCCGCTGGGTAGAGGAAGTAACACAGCAAATTGAAACCATGCAACAGAGTAATTCAGTGTGCCAAAGAGAAAACTTTAGGGATAGATGCCCGACACATCAAGAGAAGCTAACTGTTTATTGCTGGACTTGTCGCCGCTGTATCTGCCATCAGTGTGCTCTTTGGGGTGGTACACATACTGGCCATACATTCAAACCTCTTGAAGAAGTGTATGAACAGCATGTGACGCAGATCCGAGATGAAGTCTCCCAGCTACGACGTCGTCTATTGGAGCTTATCAGCCTAGTCCAGGATGTTGAAAGGAATGTCGAGTCGGTGCGTTCAGCAAAGGATGAACGTGTCCGTGAAATAAGAAATGCGGTTGAGCTGATGATATCTCGCCTGGACTCGGCGTTGAAAGCCAAGCTGCTGACTCTCATGGGACAGAAGAACAGTCTCACACAGGAAACAGAACAGTTGGAGCATTTGCTGCAGGAGATTGAACATCAGCTACACTCTAGCACCAGATCCGAACTGATAGCGAAGAGCGGCGATCTATCGAAAATGATCCATCAGGTCCGCAAGAAACCGATGGCGAGCTTCGTGACCGCGCCAGTACCGGCCGATTTTCATAGGCAAGTACAGTTCTACAATGAAATCGTGCCGAGCTACGACAGCAGCACCTTCCCGCTGAGCAACTTCACTCAGCTGCAGCACGCGGCGTCGCCCGTGTACTCCGCGCCGCTGCACGTGAACGGTCTGTGCTGGCGGCTCAAGGTATACCCTGACGGTAATGGCGTCGTGCGCGGCAACTATCTGTCCGTGTTCCTCGAGCTCAGCGCGGGTCTGCCGGAGACATCCAAGTACGAATATCGTGTGGAGATGCTGCACCAGGTGTCCCGTGACCCGTCCAAGAACATTGTACGGGAATTCGCTTCGGACTTCGAGGTTGGGGAGTGCTGGGGTTACAATCGCTTCTTCAGATTGGACCTTTTGGCGAGCGAGGGGTATTTGAACCCTGACACCGATACGCTCATCTTGAGGTTCCAAGTCCGCCCGCCGACCTTCTACCAACGCTGCCGCGACCAGCAGTGGTACATCAACCAATTGATAACCATACAGAACCAACACATACTACAGATTAATGATCTCAAAGAGCGTCTAACACTGGAGATGTCCCACAACACTTTGGTGCCGACCCGCGCGACCCCCAACCCGGTGACGTCACAGGCGGACGACAACCCCACACAGAACAACCCGGTGGATGGCAACAGCCTGAGCGACTCGCTCGTCTACAACCAGTGGAAGTTCAACAGTCCACCCAGTGTAATGAGCGGACAGAGGCTGACCAGCCCAGGTATTTTGAACACGGCGTTATTCGAGGATTCGTGTAGCAGCGTGTGTCGCAGCTCGCTGGGGGCGGGCGCCATATGCTACGGGGACTACGGCGGCGAACGTGCCAAGACCACGGCCCAGCACGCGCTCGACTCGCCGTACAACCTGCCTTCAACATCGAGGTCGGCGAGCTCGGTGCAAGCCACGAGCGGCGGCGGGGCGGGCGCGTCCGTGACGGGAGACAAGTTATCGCTGGTGTCACTGCACACACTGCTGAGCGCGGCCGGCGGCGCTCCTCGGCCGCGGGCCAGGCAGAGGCACGACTTCCTCACCGCGGACTCGCCGCTCATCGCGGTGGCGAGTACTCCGGTCAACGAGGTGAACTCGGCCAGTGGGAGCGGTGTGTGCGCGTCGCTGTCGTCACCCGAGCTGAGCGGGGAGGGGGCGAAGGCCGAGGCGGCGCCCGCCAGCCCCGAGCCGCAACCGAACGCCCCCTCGGAGTCCAGCAGCGATACCGGGCAGGACATCATGTTCAGTGAGTTGGATGGGTTCGCAGACGATAACAACCAGAGTCATGTGGAGGAGAACTCCAACGAGGAAAACGACGTGGACGAGGAGACCATGTCCGCAGCCATCACACGAAAAGACAATATGCATCCTAGTGAACCCCGTTTCATCCCATCCTATCCTATTCTATCCCATCCCTTGAATGTATCCTCCCGCCGCTCCTCAGAGGGGTCGTCCGGTAGCGGGTGCTCCTGGTGTGCGCTGGACGAGCGGCCTGGGTCCCCGGCGTCCCCTCCGTCTCCCCTGTCCCCATCCCCCGCGGGGTCCCTCTGCTCGGAGGCTCCGGTCCGCCGCCGCCTGCGACGCGCGGCGCCCTACCGCCGCTCCCGCCCGGCCTGTCCGCCAGCGGAGCCCTCGCCCTCCGCACACGTCGGATGCGTCGCCTTATATCGCATATCCTAG

Protein sequence:

>DPOGS200134-PA
MATRGDKGNGNGEEQIVETLAEVFRCFICMEKLVDAHLCPHCSKLCCYACVRRWLTEQRSQCPHCRAALHLHELVNCRWVEEVTQQIETMQQSNSVCQRENFRDRCPTHQEKLTVYCWTCRRCICHQCALWGGTHTGHTFKPLEEVYEQHVTQIRDEVSQLRRRLLELISLVQDVERNVESVRSAKDERVREIRNAVELMISRLDSALKAKLLTLMGQKNSLTQETEQLEHLLQEIEHQLHSSTRSELIAKSGDLSKMIHQVRKKPMASFVTAPVPADFHRQVQFYNEIVPSYDSSTFPLSNFTQLQHAASPVYSAPLHVNGLCWRLKVYPDGNGVVRGNYLSVFLELSAGLPETSKYEYRVEMLHQVSRDPSKNIVREFASDFEVGECWGYNRFFRLDLLASEGYLNPDTDTLILRFQVRPPTFYQRCRDQQWYINQLITIQNQHILQINDLKERLTLEMSHNTLVPTRATPNPVTSQADDNPTQNNPVDGNSLSDSLVYNQWKFNSPPSVMSGQRLTSPGILNTALFEDSCSSVCRSSLGAGAICYGDYGGERAKTTAQHALDSPYNLPSTSRSASSVQATSGGGAGASVTGDKLSLVSLHTLLSAAGGAPRPRARQRHDFLTADSPLIAVASTPVNEVNSASGSGVCASLSSPELSGEGAKAEAAPASPEPQPNAPSESSSDTGQDIMFSELDGFADDNNQSHVEENSNEENDVDEETMSAAITRKDNMHPSEPRFIPSYPILSHPLNVSSRRSSEGSSGSGCSWCALDERPGSPASPPSPLSPSPAGSLCSEAPVRRRLRRAAPYRRSRPACPPAEPSPSAHVGCVALYRIS-