Monarch geneset OGS2.0

DPOGS211319
TranscriptDPOGS211319-TA1401 bp
ProteinDPOGS211319-PA466 aa
Genomic positionDPSCF300125 + 72763-78359
RNAseq coverage301x (Rank: top 37%)
Annotation
HeliconiusHMEL0093640.092.92% 
BombyxBGIBMGA004947-TA7e-15780.23% 
DrosophilaCG12333-PA3e-16159.40% 
EBI UniRef50UniRef50_Q7QA841e-15962.58%AGAP004374-PA n=3 Tax=Pancrustacea RepID=Q7QA84_ANOGA
NCBI RefSeqXP_001601046.16e-17265.26%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3071990859e-17264.11%WD repeat-containing protein 37 [Harpegnathos saltator]
NCBI nr blastxgi|3071990851e-16764.25%WD repeat-containing protein 37 [Harpegnathos saltator]
Group
Gene OntologyGO:00055152e-58protein binding
KEGG pathwaybfo:BRAFLDRAFT_592187e-19 
 K01062 (E3.1.1.47, PAFAH)maps-> Ether lipid metabolism
InterPro domain[129-461] IPR0159432e-58WD40/YVTN repeat-like-containing domain
[127-459] IPR0110461.3e-56WD40 repeat-like-containing domain
[174-211] IPR0197811.9e-08WD40 repeat, subgroup
[331-369] IPR0016803.9e-07WD40 repeat
[157-171] IPR0204724e-06G-protein beta WD-40 repeat
Orthology groupMCL13508 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211319-TA
ATGAAATCTAGTAAAAGACGACTTCAAGCTTTAACCGAGAGTTCAGATGGACTACCAAACTATTTAAAAATGGAAGATTCAGAATCATCAATTCCACCAGTTTTTAGGTGCCGCCTCCATGAATTGTTTTCACAAATAGAAAAAGAATTTGATGTTTTGTACACTGAAAACTTAAACTTACAAGAAAAAATAGACATTCTTAGTGAAAAATTAGAAAGAGAAAGCTATGTTGGTGACAGACAGATTGTGGACTATGTTGATTTTGAAACATCAGGAAAAAACTCTAAGGTTAAATTATCTCAGAGTAATTCTCAAAAAGTAAAAACCAGTCACAAACTAAAAGTACAGACCAGCAAGATTGTGTCTAGCTTCAAAGCCCCTACATATAGCTGTCAGCTTATCAGGGAGTTCACTGGGCATAAGGATGGTATCTGGGACGTAACCACTGCACGTCCCGGACAGGCGCTGATTGGAACTGCTTCAGCAGATCACACAGCGTGTGTTTGGAGTGTTGAATGGGGTAAGTGTCTGCTTCAATACACCGGCCATGCTGGATCTGTGAACTCTATAAGATTTCATCCTACCAGAGACATAGCACTTACAAGCAGCGGTGACAACACCGCACATGTTTGGCAGGCAGCTGTGAATTGGGATCTGCCGCGTGGTCAGTCGTCCGAAGAAGAATTAGACGGTGGTGGCGAGGAAAGCTTAGGAGAAAGCGATAGACCAGAAGTATTAAGGACTCCGTTGACTGAGCTCAGTGGTCATATGGGAGTGGTTGTCGCAGCTGATTGGCTGACAGGAGGCGATCACGTCATCACCGCATCCTGGGATAGGACGGCCAATCTGTATGACGTCGAAACCGGAGACTGCTTGCAAATATTGACAGGTCATGACCATGAACTGACACATGCGTCATCTCACCATAGCTCCCGTCTCGTGGTGACGGCTTCCCGTGACACCACCTTCAGACTGTGGGATTTTCGTGAACCGATTCACTCCGTGTCCGTTTTCCAAGGGCATACTGAGAGCGTTACTTCAGCGGTTTTCACAAGGGAGGACAAAGTTGTTTCCGGTTCCGACGATAGATCTGTGAAGGTTTGGGATGTTCGTAACATGCGTTCAGCTCTGGCCACCATACGTTCAGATTCATCAGTGAACCGCGTCTCCGTGAGCTCGGGTGGTCTGATAGCGATACCCCACGACAACCGACAGGTGCGACTGTTTGACCTTCAGGGTCAAAGACTGGCCAGGCTGCCGAGGTCGAGTAGACAGGGTCATCGTCGCATGGTGACCTCAGTGTCATGGGTTGAGGATGTTGCTTCCAATATGAATTTCTTCAGCTGTGGCTTCGACCGTCGCATACTGGGCTGGTCCATACAGCCATCTAAGGACAACTGA

Protein sequence:

>DPOGS211319-PA
MKSSKRRLQALTESSDGLPNYLKMEDSESSIPPVFRCRLHELFSQIEKEFDVLYTENLNLQEKIDILSEKLERESYVGDRQIVDYVDFETSGKNSKVKLSQSNSQKVKTSHKLKVQTSKIVSSFKAPTYSCQLIREFTGHKDGIWDVTTARPGQALIGTASADHTACVWSVEWGKCLLQYTGHAGSVNSIRFHPTRDIALTSSGDNTAHVWQAAVNWDLPRGQSSEEELDGGGEESLGESDRPEVLRTPLTELSGHMGVVVAADWLTGGDHVITASWDRTANLYDVETGDCLQILTGHDHELTHASSHHSSRLVVTASRDTTFRLWDFREPIHSVSVFQGHTESVTSAVFTREDKVVSGSDDRSVKVWDVRNMRSALATIRSDSSVNRVSVSSGGLIAIPHDNRQVRLFDLQGQRLARLPRSSRQGHRRMVTSVSWVEDVASNMNFFSCGFDRRILGWSIQPSKDN-