Monarch geneset OGS2.0

DPOGS202147
TranscriptDPOGS202147-TA1386 bp
ProteinDPOGS202147-PA461 aa
Genomic positionDPSCF300162 - 286288-291569
RNAseq coverage322x (Rank: top 35%)
Annotation
HeliconiusHMEL0108920.082.11% 
BombyxBGIBMGA003421-TA0.075.76% 
DrosophilaCG6751-PA4e-10342.55% 
EBI UniRef50UniRef50_F4W8J52e-11648.92%Periodic tryptophan protein 1-like protein n=7 Tax=Formicidae RepID=F4W8J5_ACREC
NCBI RefSeqXP_001663005.16e-11646.57%wd-repeat protein [Aedes aegypti]
NCBI nr blastpgi|3320295979e-11648.92%Periodic tryptophan protein 1-like protein [Acromyrmex echinatior]
NCBI nr blastxgi|3320295974e-12049.02%Periodic tryptophan protein 1-like protein [Acromyrmex echinatior]
Group
Gene OntologyGO:00055153.8e-39protein binding
KEGG pathwayath:AT2G437701e-10 
 K12857 (SNRNP40, PRP8BP)maps-> Spliceosome
InterPro domain[115-440] IPR0159433.8e-39WD40/YVTN repeat-like-containing domain
[129-436] IPR0110464.9e-39WD40 repeat-like-containing domain
[222-260] IPR0197815.8e-09WD40 repeat, subgroup
[220-260] IPR0016808.5e-07WD40 repeat
Orthology groupMCL13404 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202147-TA
ATGGAAGAGGAGGGTACTCCCACTGTAAGTTTAGTATCATGTATGCATTTTGTGAGACGGGGAATAGCGAAACCAGTGCCAGAAAAGATTGAATTGACAGAAAATGAATTAGAAAAAATTATAAAGCAGACTGCTGAAGATCTTCGTTTAACAGAAGCAGGAGATGATCAAAGTGGAGAAGAAGATGAAGCTGCACAGAGTATCAGGGAGCCTCCAGCAAACCCCAATGATGAGTTTGACTTTGAACATTATGACCAAGAAGATTCGAGTAACCCTGTAGGTATAGGGACTATAGCAACTCTACCTAACTTAGGTGATCTCAGTGAAAACATACAAATCAGAACAGAAGGTCCAGATAGTGATGAAGAAGATGACATCATTAAGCCAGATGACAACTTATTACTTGTAGGACATGTTGAAACAGATGCCAGTGTCTTAGAAGTTTATATTTTCAACAAAGAAGAGGGATCATTCTACGTCCATCATGACATAATACTGCCCTGGTTTCCGCTGTGTATAGAGTGGCTCAATCATGACCCCTCAGATCCACAACCAGGCAATCTTTGCGCTCTCGGTGGCATGGACCCAGTGATACAAGTGTGGGATTTGGATATTGAAAACTGTTTGGAACCGGCTTTCAAGCTCGGCAGGAAACCAAATAAGAAGAAAAAGACAAAAAGAATTGGTCACAAGGATGCTGTTCTGGATCTGTCTTGGAACACGAACTTTTCTCACGTCTTAGCGAGTGGCTCGGCGGACAACACTGTACTACTGTGGGATCTCGATCAAGGCTTACCACACACTAAACTAACCTACTTCGAAGACAAAGTCCAATCACTATCGTTCCACCCCCTGGAAGCCCAGACCCTCCTGTCTGGTTGTTGTGACGGCCGAGCGCGTGTGTCGGACTGTCGGGACGAGGCCGCCTTCCGCACGTGGGTGCTCCCCACTGAGATAGAGCGAGTGCACTGGGATAGGAACCAACCGTTCTGTTTCGCGATGAGCAACAATATCGGTAAAGTGGCGTACGTGGACGTCAGACAGGAAGAACCGTTGTGGACCATCGACGCTCATCAGAAGGAAGTCACAGGACTCATTTTAAGTGAAAAGGTTCCAGGGCTGATGATAACTGTCGGCTCGGATGAAAAACTCAAATGCTGGGATATCACGGGCCCTACTCCGCTACAAATAAACGAGCGCACCAACAGGGTCGGACAGGCCTTATGCGCCGCTCAGTGCCCGGAGGCGCCGTTCGCCGTAGCGGTGGGCGGAGACAACAAAGAGTGCTACATCGAAATGGTAGACCTCAGCAACAACGATGAAGTTATGAACCGTTTCGGCCAGCGCGTCACGACCGAATCCAACGCTGAAGCTATGGACGCGTAA

Protein sequence:

>DPOGS202147-PA
MEEEGTPTVSLVSCMHFVRRGIAKPVPEKIELTENELEKIIKQTAEDLRLTEAGDDQSGEEDEAAQSIREPPANPNDEFDFEHYDQEDSSNPVGIGTIATLPNLGDLSENIQIRTEGPDSDEEDDIIKPDDNLLLVGHVETDASVLEVYIFNKEEGSFYVHHDIILPWFPLCIEWLNHDPSDPQPGNLCALGGMDPVIQVWDLDIENCLEPAFKLGRKPNKKKKTKRIGHKDAVLDLSWNTNFSHVLASGSADNTVLLWDLDQGLPHTKLTYFEDKVQSLSFHPLEAQTLLSGCCDGRARVSDCRDEAAFRTWVLPTEIERVHWDRNQPFCFAMSNNIGKVAYVDVRQEEPLWTIDAHQKEVTGLILSEKVPGLMITVGSDEKLKCWDITGPTPLQINERTNRVGQALCAAQCPEAPFAVAVGGDNKECYIEMVDLSNNDEVMNRFGQRVTTESNAEAMDA-