Monarch geneset OGS2.0

DPOGS210007
TranscriptDPOGS210007-TA1194 bp
ProteinDPOGS210007-PA397 aa
Genomic positionDPSCF300327 - 15726-18281
RNAseq coverage4x (Rank: top 89%)
Annotation
HeliconiusHMEL0129538e-1423.78% 
BombyxBGIBMGA008388-TA4e-4844.55% 
DrosophilaCG43088-PA1e-1222.56% 
EBI UniRef50UniRef50_UPI000206043F1e-9043.25%UPI000206043F related cluster n=2 Tax=unknown RepID=UPI000206043F
NCBI RefSeqXP_001946870.19e-9845.27%PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum]
NCBI nr blastpgi|3286977752e-9644.75%PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3286977756e-9545.50%PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00167881.2e-10hydrolase activity, acting on ester bonds
KEGG pathway 
InterPro domain[146-342] IPR0069121.2e-10Putative harbinger transposase-derived nuclease
Orthology groupMCL10304 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210007-TA
ATGGTTGACTGGGATTTAGTATTAATAGCACTTCTAGCTGAAGATGAAGAGAGACAAATAAGAACTGCAAATTTGACGAAAAGACAATTCTGGGTGCATAATTTATGGAGAACAAGATCTACAAATGGTGAATTCAGCAACTTGTTTAATGAGTTGAGATACGACTTGCGAAAGTTCTATGATTATTACAGAATGGACTATGAAAAATTTGAAAAGTTAAGTTATTTGCTAAAATCGCATATAAAGAAAATTAAAACTAATTTTCGATCGCCTATACCTGTTACAGAGAGATTGTCTGTATGTTTAAGGTATTTAATTACAGGAGCTTCTTTTAAGTCTTTAGCTTTTAGCTATCGTATGGGATTCACTACTGTGCGTAATATAGTCCACGAAACCTGTCAAGCCATATTTACTGTACTAAGGTCTACTGCATTGCCGAAACCCACGAGCCAACAATGGCAGTCAATTGCAACCGACTTTGATAAATTTTGGAATTTTCCTAACTGCATTGGTGCCATAGATGGTAAGCATTTCAAGATTAGAGCTCCTAATAACAGTGGAAGTATGGATATCAACTACAAGAAGTTCTTTAGCATCGTTTTATTAGCCGTCGTGGATGCAAAATACAAATTTGTGATTGTAGATGTCGGAGCATATGGTCGAAATAGCGATGGCGGTACACTTCAAAGCTCTAAATTTGGTTCGAAACTACGTAATGGTTTTTTATGCATACCACCGGAAAAGGCTTTACCACATTCAACTCAAAAATTACCTAACGTATTTGTCGCCGATGAAGCCTTTCCACTAACAGAAAATATTATGAGACCATACCCCTCACATCTTTTAAATGATGAAAAAAAAAGGATATTCAATTATCGCTTAAGTAGAGAACGGCGTATTGTAGAAAATGCTTTCGGAATGTTACAAGAAAGATTTGAATTATTTCAAAAAGGTATAAAAGTTCAACCAAAGTACATTAACAACATTATACTAGCAAGCACGTGTTTGCATAACTTTATCATTGATGGTCATTCTATGGACGCCCTTAGCTCTAATACTAATAATGCAATCGATAGAACCAATGATTCAGTTTTTAACAATCTGGATGGGATGGTTGTAAGAGATTGTTTCACGGAATATTTTTCTAATGTTGGTAACGTGTACTGGCAAAATGAAATTGTTAATAGATGTTAG

Protein sequence:

>DPOGS210007-PA
MVDWDLVLIALLAEDEERQIRTANLTKRQFWVHNLWRTRSTNGEFSNLFNELRYDLRKFYDYYRMDYEKFEKLSYLLKSHIKKIKTNFRSPIPVTERLSVCLRYLITGASFKSLAFSYRMGFTTVRNIVHETCQAIFTVLRSTALPKPTSQQWQSIATDFDKFWNFPNCIGAIDGKHFKIRAPNNSGSMDINYKKFFSIVLLAVVDAKYKFVIVDVGAYGRNSDGGTLQSSKFGSKLRNGFLCIPPEKALPHSTQKLPNVFVADEAFPLTENIMRPYPSHLLNDEKKRIFNYRLSRERRIVENAFGMLQERFELFQKGIKVQPKYINNIILASTCLHNFIIDGHSMDALSSNTNNAIDRTNDSVFNNLDGMVVRDCFTEYFSNVGNVYWQNEIVNRC-