Monarch geneset OGS2.0

DPOGS202119
TranscriptDPOGS202119-TA1752 bp
ProteinDPOGS202119-PA583 aa
Genomic positionDPSCF300150 + 319393-322655
RNAseq coverage199x (Rank: top 47%)
Annotation
HeliconiusHMEL0023860.077.18% 
BombyxBGIBMGA006962-TA0.067.64% 
DrosophilaCG2260-PA2e-15247.89% 
EBI UniRef50UniRef50_G6DF450.0100.00%Putative uncharacterized protein n=2 Tax=cellular organisms RepID=G6DF45_DANPL
NCBI RefSeqXP_971200.11e-18059.63%PREDICTED: similar to CG2260 CG2260-PA [Tribolium castaneum]
NCBI nr blastpgi|910778322e-17959.63%PREDICTED: similar to CG2260 CG2260-PA [Tribolium castaneum]
NCBI nr blastxgi|910778321e-17456.33%PREDICTED: similar to CG2260 CG2260-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055157.8e-34protein binding
KEGG pathwayptm:GSPATT000059000011e-06 
 K13341 (PEX7, PTS2R)maps-> Peroxisome
InterPro domain[408-486] IPR0129522.5e-36BING4, C-terminal
[166-441] IPR0110467.8e-34WD40 repeat-like-containing domain
[162-432] IPR0159431.9e-29WD40/YVTN repeat-like-containing domain
[319-358] IPR0016801.1e-06WD40 repeat
Orthology groupMCL10553 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202119-TA
ATGGAAAACCAACCTTTACGAAATAAAACGAAAAGGTACTTTGACACCGTGCCCGAAAAGGACGAGAAGGCTCAAAATAAGAATGAAAATGTCAAAATATTTACTGTAAAAAGTACTGCAAAATTACTGAAAATTCATAAGTATAAGGAACAAAATATGAATCGTAGAAAGAAAACTTTTTTTAATAACAATAAAATAAAAAAGAATTTTCCCAGTAAAGCTCCTATTGATCCCAAAAAATTGGAACTCCATTCAAGGGGTGAAGGTTTTACAGGACAAGGGGTATTGCATCCACTTCATTTATTGAAATTAAAGAAAAGAGAAAAGAAATTCAGATATGCGCAAGAGCAGGCTGCTAGAGCGGACATTCTATTGACAGAAGGACAAGGGTATTTAGAGGTAGATGAAGATAATAGAACTACATCCATTAACCAAAAAGTTATAGCTGATAATGTAGACATCACAGCTGCAACTAAAATATTTGAATTAAATCTTGAATTCGGTCCATACAATGCCAAATATTCCCGTAATGGTAGGCATTTACTTTTGGGTGGTAAAAAAGGACATTTAGCAGCTTTTGATTGGGTAACAAAGAAACTGCATTTTGAAATTAATGTTATGGAATCTATTCATGATATGAGTTGGCTACATGTTGAAACAATGGTCGCAGCAGCTCAAAAAGAATGGTTATACATATATGACAATACAGGAACTGAAATACACTGTGTTAAAAAATTAGACAAAATACTTAAAATGGAATTTTTGCCTTATCATTTTCTACTAGCTACAGTTAACGAATATGGATTTATGTCGTGGCTGGACATTTCTATAGGAGAAATAGTTGGTCATTATAATAATAATATGGGGAGGACATCAGTTATGACTCAGAATCCATATAATGCTACTGTATGTTTGGGAAACCCCAAAGGTGTTGTATCAATGTGGTCGCCAAGCTCGAAAAAGCCATTAGCAAAAATATTGTGCCACAAAACACCAATAACTGCTATTGCTGTTGATAATAGGGGCATGTACATGGCTACATCAGGTGTTGACAGGAGTTTGAAAATCTGGGATATAAGAAACTTAGATGGACCGCTACAGCATTATAAATTGCGCAGTGCTCCTGTACATCTTGAGTTTTCCCAAAAAGAAATGTTGGCTGTTGGATTAGGAAATAATGTGGAAGTGTATAGCGATTGTTGTATAAAAACAACAGACCGACCTTACCTCAGGCATAGAATGGCAAAAGAAATATCTAATTTCAAATTTTGTCCCTTTGAAGATGTTTTGGGTATTGGCAATACTGGGGGATTCACCAGTATCATTGTACCAGGCAGTGGTGAACCTAACTTTGATGCTCTGGAAAGTAATCCATTCCAAAATAAGAAACAAAGGAAGGAGGCAGAAGTCAAGGCATTACTTGAAAAGATACCTGCAGAACTTATCACTCTTAATCCATTTGAGGTTATGGAGGTAGATTTGCCATCAATGCAGGACAAAGTGGAGGCTCGCAATAACCTTTTGTACCTGAAACCTAAAAATGTGGACTTTACACCAAAACACAAGAAGAAAGGAAAGACTAACATTGCAAGAAAGAAAATTATAAAAGATGCAGCTCGAAAGAAATTTATCAACCAATCAATAGAAGCAAAGAAGATTCTTAAACAACCAGAAGAAGATAAAATTTCAAAGCCCAAACAATCATTTGGAGTTTTGGACAGATTTATATCAAAACCCAAGGTTAAACAATAA

Protein sequence:

>DPOGS202119-PA
MENQPLRNKTKRYFDTVPEKDEKAQNKNENVKIFTVKSTAKLLKIHKYKEQNMNRRKKTFFNNNKIKKNFPSKAPIDPKKLELHSRGEGFTGQGVLHPLHLLKLKKREKKFRYAQEQAARADILLTEGQGYLEVDEDNRTTSINQKVIADNVDITAATKIFELNLEFGPYNAKYSRNGRHLLLGGKKGHLAAFDWVTKKLHFEINVMESIHDMSWLHVETMVAAAQKEWLYIYDNTGTEIHCVKKLDKILKMEFLPYHFLLATVNEYGFMSWLDISIGEIVGHYNNNMGRTSVMTQNPYNATVCLGNPKGVVSMWSPSSKKPLAKILCHKTPITAIAVDNRGMYMATSGVDRSLKIWDIRNLDGPLQHYKLRSAPVHLEFSQKEMLAVGLGNNVEVYSDCCIKTTDRPYLRHRMAKEISNFKFCPFEDVLGIGNTGGFTSIIVPGSGEPNFDALESNPFQNKKQRKEAEVKALLEKIPAELITLNPFEVMEVDLPSMQDKVEARNNLLYLKPKNVDFTPKHKKKGKTNIARKKIIKDAARKKFINQSIEAKKILKQPEEDKISKPKQSFGVLDRFISKPKVKQ-