Monarch geneset OGS2.0

DPOGS210122
TranscriptDPOGS210122-TA2316 bp
ProteinDPOGS210122-PA771 aa
Genomic positionDPSCF300017 + 1497440-1505211
RNAseq coverage390x (Rank: top 31%)
Annotation
HeliconiusHMEL0107010.074.74% 
BombyxBGIBMGA000230-TA8e-10863.21% 
DrosophilaPlap-PA9e-15038.77% 
EBI UniRef50UniRef50_UPI00022C91B83e-16340.68%UPI00022C91B8 related cluster n=1 Tax=unknown RepID=UPI00022C91B8
NCBI RefSeqXP_392743.21e-16540.61%PREDICTED: similar to phospholipase A2, activating protein [Apis mellifera]
NCBI nr blastpgi|3504021751e-16240.68%PREDICTED: phospholipase A-2-activating protein-like [Bombus impatiens]
NCBI nr blastxgi|3504021758e-16340.68%PREDICTED: phospholipase A-2-activating protein-like [Bombus impatiens]
Group
Gene OntologyGO:00055151.2e-61protein binding
KEGG pathwayame:4092193e-165 
 K14018 (PLAA, DOA1, UFD3)maps-> Protein processing in endoplasmic reticulum
InterPro domain[4-296] IPR0159431.2e-61WD40/YVTN repeat-like-containing domain
[4-296] IPR0110461.7e-59WD40 repeat-like-containing domain
[512-758] IPR0135353.1e-39PUL
[342-452] IPR0151557.9e-37PLAA family ubiquitin binding, PFU
[220-258] IPR0197811.7e-08WD40 repeat, subgroup
[219-258] IPR0016809.2e-08WD40 repeat
Orthology groupMCL11501 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210122-TA
ATGGCTATTCCTGAATATAAACTAAGTTCTGTTTTGTGTGGCCATTCCATGGACGTTAGATGTGTAGCCACAACAAAAGAATCTTGTATACTTTCTGCTTCCCGAGATAGGACTGCTAGATTATGGCACCCGGAAGGGACTAAAGATTTCGTAAACGTAGTGACCTATAAAGGTCACGATAACTTCGTTTCTTGTGTTTGCTGGCTTCCACCTTGCGAAGCATTTCCTGAAGGACTGGTTGTTACTGGAAGCAATGATAATACTATACTAGGATATAATCTTCAGGATGGTGCAATTCAGATACATTTGAAAGGTCATAATAATGCAGTGTGTAGTGTTGCTCCAGGAAATGATTCAGGAATACTTTTGAGTGCAAGTTGGGACAACACATCCAGGATATGGAATATCAACTCCCCACAAATGTCACCAGTCGTACTCAAAGGTCACCAAGCTGCTGTGTGGTGTGTTATAGAATTGAGTAATGGAGTGTATGCCACTGCTTCGGCTGATAAAACTATCAAACTGTGGAGGAAAGATGGAGCTTTGATCAATACACTTTCTGGTCATACGGACTGTGTGAGAGGCTTGACAATAGCAAGTTCAGAAAGTTTCCTAAGTTGTTCCAATGACGCATCCATCAAGTTGTGGTCGAACAAAGGAGATTGCATCAACACATATTACGGACATTCTAATTATGTTTATGGTATAAGCAGCAATCCTGAAAGCGGTATGTTTGCTAGTTGTGGAGAAGATGGCGCATTGCGTCTTTGGAGCGGTACTGAGAGTATTGCACTAAGGCTGCCCGCACACTCTGTCTGGAGTATAGCCTGCCTTAATAACGGAGATGTTGTCACTGGTTCTAGTGATGGCATCATCAGAGTGTTTACAAAGGATCCAGTGAGATTCGCTGACGAGACAACACTGAAAAACTATGAGGAAGATTGTAAGAAAATGATTGAAGCGTCACAACAAGAAATAGGAGGTTTTAAACTCTCAGAACTTCCCGGACCGGAAGTTTTACTGGAGCCTGGCCGCACGGATGGTCAGACGAAGTTGGTGCGGAGAGGGGCCTCAGTGAAATGTTACGCGTGGCGCGCCGCGGGGGGAACGTGGGAGGAGCTTGGTGACGTCATGGGATCCACGCCCCCCACCCAAGGGAAAACCATGTACCAAGGACAGGAGTATGACTTCGTGTTCAGCGTTGACATCAAGGACGGTGCTCCGCCCATCAAGCTGCCATTCAACAAGACGGAGGACCCGTGGGTCGCGGCACAGGCGTTCATACACAAACACGAGCTGCCACAAGTATATTTAGAACAAGTCGCTAACTTTATAATAACGAATGCTAAGTTAGACTCCGTCCCAGCTTCTAGTAACGGGTACGCGGATCCGTTTACAGGAGAGTCTCGTTACGTGCCGAGTTCAGCTTCCCCGGCGGGCCCTACCGGGGGGCTCCCTACTGTTTCCTCGGGGCCCCTCAAAGACCCCTTCACTGGCGAAGGCGCCTACACAACCTCCAGCAATGAGAAACCCCTCATACCTCACGATGCATACATCAGGTTTGATGCGGCAAATCTTAAAGCTATACATGACAAACTAAAAGAGTTCAACAGTAAAGTGGGAGACGGTTTGAACGCGTTCACAGACGAACAGATTGAAAATATTGTGAAGTTAGGAGAAATGGACTGCACTTTCAATCCGGAAACCGTAACCCTGCTTAAGAAAATGCTAGAATGGCCCAAAGAAATTCTGTTCCCTGTACTCGACGTCACTAGATTGGCCGTAAGAAACAAAGATATCAACACTCAAATATTTGACACAACATATGGGCCAAACTTCGTTAAATATCTGCTGACATTGTTAAGTCCAGATAATCTGTCACCCAACCAGTTACTCTCTATACGTGTGTTAGTGAATGCGTTCAGTGCTCTGTCCGGCGAGATGCTAGTACTGTCAGCTCGTGAAAGACTTCTGGAAACTATGAACATGCTCACAAACATCAGTAACAACGCCCAGATAGCCGCTATGTCATTACTCCTTAACTTGTCGGTAGCTCTTTGTCAGCAGCCAGATAATATAGACCTAGCAGATTCTGTTGTTAATTTACTCAACAAAATAACAGATAATGAGGCTTACTTCAGAGGTCTTGTTGCATTAGGCACTTTATTAGCGGAATCTCCAAACAAACTTATCATACAGACGAAGATTGTAACCAGCAATAATCTACATAACAGGTTGAAAAGAGACAGTTCCACAGAGATTCCCAACTTCAAGAAAATATCAATTTGTTCCCAACAAATATTAAGACTGTTATGA

Protein sequence:

>DPOGS210122-PA
MAIPEYKLSSVLCGHSMDVRCVATTKESCILSASRDRTARLWHPEGTKDFVNVVTYKGHDNFVSCVCWLPPCEAFPEGLVVTGSNDNTILGYNLQDGAIQIHLKGHNNAVCSVAPGNDSGILLSASWDNTSRIWNINSPQMSPVVLKGHQAAVWCVIELSNGVYATASADKTIKLWRKDGALINTLSGHTDCVRGLTIASSESFLSCSNDASIKLWSNKGDCINTYYGHSNYVYGISSNPESGMFASCGEDGALRLWSGTESIALRLPAHSVWSIACLNNGDVVTGSSDGIIRVFTKDPVRFADETTLKNYEEDCKKMIEASQQEIGGFKLSELPGPEVLLEPGRTDGQTKLVRRGASVKCYAWRAAGGTWEELGDVMGSTPPTQGKTMYQGQEYDFVFSVDIKDGAPPIKLPFNKTEDPWVAAQAFIHKHELPQVYLEQVANFIITNAKLDSVPASSNGYADPFTGESRYVPSSASPAGPTGGLPTVSSGPLKDPFTGEGAYTTSSNEKPLIPHDAYIRFDAANLKAIHDKLKEFNSKVGDGLNAFTDEQIENIVKLGEMDCTFNPETVTLLKKMLEWPKEILFPVLDVTRLAVRNKDINTQIFDTTYGPNFVKYLLTLLSPDNLSPNQLLSIRVLVNAFSALSGEMLVLSARERLLETMNMLTNISNNAQIAAMSLLLNLSVALCQQPDNIDLADSVVNLLNKITDNEAYFRGLVALGTLLAESPNKLIIQTKIVTSNNLHNRLKRDSSTEIPNFKKISICSQQILRLL-