Monarch geneset OGS2.0

DPOGS213612
TranscriptDPOGS213612-TA1362 bp
ProteinDPOGS213612-PA453 aa
Genomic positionDPSCF300033 + 847264-849369
RNAseq coverage97x (Rank: top 62%)
Annotation
HeliconiusHMEL0136871e-11278.14% 
BombyxBGIBMGA011801-TA0.083.44% 
DrosophilaCG32085-PA2e-17476.62% 
EBI UniRef50UniRef50_UPI00017917276e-15758.72%UPI0001791727 related cluster n=1 Tax=unknown RepID=UPI0001791727
NCBI RefSeqXP_971494.10.072.21%PREDICTED: similar to AGAP012123-PA [Tribolium castaneum]
NCBI nr blastpgi|3072021500.067.58%F-box/LRR-repeat protein 16 [Harpegnathos saltator]
NCBI nr blastxgi|910929160.072.21%PREDICTED: similar to AGAP012123-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL14881 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213612-TA
ATGTCGTCGGTGTCAGCGCAAGGAGTGGTCGAGCGCGCTTCCGCGGAGCTATCCCGCCGCATAACAGGACTGGGACTTCGAGGTCGTAAACGTTCGCCGGGAGTCGTCGAGAGAGTAGCCAATGCCTTGTGCGGGCCTGCCACCACCGCCCCACCGAGGATACCCCGCCGGAGACGACCCTCGCCGAGACCGCTTGTGTGGAGCCAATTTATTATAGAGGAGAGATTCCTGGAGAAGTTCTTTCTGTATTTCAATGGCAGCGAAAGAAGAACCCTTGCTCAAGTCTGTACTAAATGGCGTGATGTCCTGTATTCTTCTCCCCGTTGGTGGAATGGCCTAGTCGCTGTTCTGGATTGTAGGGAACTACGATCTGAAACTGGTTGTTGCATGCAAAGATTTTACAATTCGGTTGTGAGGAGGGGTATCCGAGGATTTGTACTAATATCTGCAACAGATGATGATATAAATGAATTAATAAAACAGTTTCCGCTCTCAGCTCATCACATACATGCTATTGGTTTAAAAGGATGCACAATAACCGATCGAGGTTTAGAATCAATACTCGATCATTTACAGGTTCTCTTTGAACTAGAACTAACAGGCTGTAATGAAATAACTGAAGCTGGTCTCTGGGCTTGCTTAACTCCTAGGATTGTATCACTTACACTTACCGACTGCATTAATATTGCTGATGAGGCAGTGGGTGCGGTTGCTCAGCTGTTGCCGTCGCTATATGAGTTCTCATTGCAAGCATACCATGTGACCGATGCAGCACTCGGTTATTTCTCACCAAAACAAAGTGCCTCACTCAGCATCTTAAGATTACATAGTTGCTGGGAGCTTACTAACCATGGCGTCGTTAACATTGTGCATTCTCTGCCGAACCTGACAGTGCTGTCCCTCAGTGGATGCAGCAAGGTCACTGATGAGGGTGTGGAACTCCTGGCTGAGAATCTGCCGCGTCTACGAAGCCTCGATCTCAGCTGGTGTCCGCGGGTCACTGACAACGCGCTCGAATACATCGCCTGCGACCTGAACCAGCTTGAAGAACTCACGTTGGACCGATGTGTGCATATAACGGATATCGGCGTGGGCTACATTAGCACAATGCAGTCGTTGGCCGCGCTGTTCCTGCGCTGGTGTTCTCAAGTGCGGGACTTTGGTGTGCAGCATCTGTGTGGCATGCGAAGTCTGCAGCTACTGTCGCTCGCCGGTTGTCCGCTTCTCACATCCGGTGGCCTCTCAAGCTTGATCCAATTGAGGCAGCTACGAGAACTCGAACTGACAAATTGTCCGGGAGCATCCCCTGAACTGTTTGACTACCTTCATGAGCATCTACCGCGTTGCCTCATCATAGAATAA

Protein sequence:

>DPOGS213612-PA
MSSVSAQGVVERASAELSRRITGLGLRGRKRSPGVVERVANALCGPATTAPPRIPRRRRPSPRPLVWSQFIIEERFLEKFFLYFNGSERRTLAQVCTKWRDVLYSSPRWWNGLVAVLDCRELRSETGCCMQRFYNSVVRRGIRGFVLISATDDDINELIKQFPLSAHHIHAIGLKGCTITDRGLESILDHLQVLFELELTGCNEITEAGLWACLTPRIVSLTLTDCINIADEAVGAVAQLLPSLYEFSLQAYHVTDAALGYFSPKQSASLSILRLHSCWELTNHGVVNIVHSLPNLTVLSLSGCSKVTDEGVELLAENLPRLRSLDLSWCPRVTDNALEYIACDLNQLEELTLDRCVHITDIGVGYISTMQSLAALFLRWCSQVRDFGVQHLCGMRSLQLLSLAGCPLLTSGGLSSLIQLRQLRELELTNCPGASPELFDYLHEHLPRCLIIE-