Monarch geneset OGS2.0

DPOGS203675
TranscriptDPOGS203675-TA3258 bp
ProteinDPOGS203675-PA1085 aa
Genomic positionDPSCF300010 - 2257241-2267595
RNAseq coverage388x (Rank: top 31%)
Annotation
HeliconiusHMEL0133330.075.21% 
BombyxBGIBMGA003470-TA0.057.13% 
DrosophilaCG8920-PB2e-4824.58% 
EBI UniRef50UniRef50_UPI0002063C769e-12130.00%UPI0002063C76 related cluster n=3 Tax=unknown RepID=UPI0002063C76
NCBI RefSeqXP_001121997.13e-12230.67%PREDICTED: similar to CG8920-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3800198142e-12030.10%PREDICTED: tudor domain-containing protein 7-like [Apis florea]
NCBI nr blastxgi|3800198143e-11730.04%PREDICTED: tudor domain-containing protein 7-like [Apis florea]
Group
KEGG pathway 
InterPro domain[888-1006] IPR0081919.7e-19Maternal tudor protein
Orthology groupMCL12433 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203675-TA
ATGGCTGATAAAGAACAGGTAATCCAGGCACTTAGAGCAACACTTCTTTCAGTGAAAGGAGCTATAACAATAAAACAATGCAACAGAGATTATAGGGAATTGCAAGGGGAATGGATTCCTTTCAAGAAGTTAGGCTATCCAACCCTCGAAAAATTATTCCAAGATGTTCCCGGGTTTAAAATAACTCAAGTGAATGGAGATTGGGTTGTTGATGCTATTGCAAGTCAGGAAACACAACACATTGCTTCTATGGTAGCACGTCAGAAGACAAACAAAAAGCAGATAATCAAATTAAATCATACTAACCGCTTTCCAAAGAAGCAAGGATCTTGGAGAAAGCCAACTCACACATCTAATTACACATCGAGTTACAACAATCAATACTCAAACCAGTATTACAGAGTAAAGACAACACCATTCAATAACAACAGTTACAATGGAGGCAAGTATTACAAGAATAACAAATCAATTGACTCAAAGAATAGTTCAATTAGTTCCAAAACATCAACAAAAGGAAGTGAAAATCAGGATAAGCCTGTGCTAAAGAGTGTTAATTCTTCATCGCAGATCATTCAAAAGTCAAATATAATATCAGATGATTATAAACATTCCATAAAAAATGAGAAAGAGACTGATTCAAAACCCAATCGAAAGGTATCTGCTTCCCAAAGGTTATCAAAGCTATGTGACAGATTGAGTACAGTTGACTTGATACCATTGCAGCTTCCAGCTCACAATAATTTATTGGATAGTACTGATCCAGTGAGTGTTGCATGTCCTCCGCCGAAATTAGAGCCACCTCCAGATTCAACAGCTGTGGAGAAATTGGAATGGACTTGTCGAAAGCTCGGTCTGCCCCCGCCGGTCTATAAAGTTAATGATATACGACCAAAACGAGGACCCGTTAGTTATGCCTGTAATATCAAGGTGGGGACATACCATTCCATGTCATATCCGGTGCTGTCGCCGTCAGAGGAGGCGGCCCGCGAGTTGGCTGCTGTGAATCTTCTGGCGCAGGTTGAGAGTCTCGAAGCGGGTGGACTGGTGACGGCGAGTGCTGCGTCTGCTGTGGCCGGACTGACCGCGCTGGTCGCTGAACATTCCGCCGGGTTACTGGCCAAACATGTTCCTCACCAGTACAGGGAAAAACATGGAGAAAATTTGCCAAACAATTGGTTGGAGCTCATAGAGAACTGCCCATATATTTTAAAGGATAGGGTGGCAAATGACCTTCTAGTTCTCCTTCCTAATATAGAGGAACCTACTTCTCCGCGAGTAGAAAATTTCCACCAGGATAATACTCTCGACACCGACCTACCGCCATTGCAGTTCCCAGAGGATGATTTCTGGAATGTTTTCATTACTGTAGCAAACTCTACTTTAGAGGTCTGGCTCAGAATTATCGGACCACAATACAGCGACGCTTTCGAAGTACTTCTAGTAGATATGGCAACTTATTACGAACATTCTGGAACTACTGTGGATAAATCCATGATAATATTAAATGCTTGGTACGCAGTTAATCTGAACGACGGTGGATGGCAACGAGCGAAAATATTAGAAATCGAAGACGAAACAGCGACCGTGTTCCTGGGAGACCACGGAGACGACGACACAGTTTATTTTAAGAATATCAAGATCTTAGAGCCTCAGTTTAGGAAATTACCGGCGCAGGCGATCCTTTGTCGCTTGGAAGGTGCTGAGGAGTTGGCGGCGAGTGAGGCGGGCGCGACGCTGGTGAGGCGTCGTCTGCCCGGGGAGGTGTTGGTGGCGGCGCCCGGCCCCCGCCGCGACCCCACCGACCCCTCGGTGGCCGTCGTCCTCTACGACACCTCCACACCCAGGGATCTCAACCTTAATAAAGAGATCGTACACGATTTCGCCATTTCCGGCGCGTTTACACTTACTCAGAAGCTGTGTGAGGTTGAGGTGGGTTGTGTGACGGAGGAGGGTCGCGTGTGGGTCTCTCGTGCGGGCGGGGCGGAGGCTGTCAGAGCAGCTCTCGCGCTGCTCACCTCCGGACCTTACCGCCGCCCTCTGCCGGCTGCACCACACGCTCCCCCCACGACACCTAACGCTCTATATATCGTGCGCACTCTAGCTGGTGACTGGGTGCGTTGCATCATTATCAGTGGTCTAGACGGTGAGGGCACGGTTCGCACTCAGCTGGTCGACAGCGGTCTGATCCTCCGTGCTCCACTCTCTTCCCTTGTACCGCTTCAGGCCTTCTCACCCGCCCTCAATGAGTACCCCTACCAGGCGAAGCAGGTCCGGCTGGGTCCGGCGGAGCGTGCGGCCGGCAGTATGGTGTCCCGCTTGCGAGACATGCTGCTGGGAGCGAGCGTGTTGTGCCGCGCGCTGCCCGCCTCCGCCCCCTCCCCCGGCTCCCCGCCCAACGTGGAGCTGTACGCGCGCTGGGGACCCCAACACATGCTGGCCTCGGTCAACGACGGCATACTCATGGAGTACGAGCTGATACAACTAGGAAAACTAGAAGAGAAAAAGGAGGGTGATACAACGAACCATTTGGAAGCTTTGCACAGGAAGAAAGAGCGTATCTTCGGTACGGGTTCTACGGGGGATGGAGGGGGGGCGGCGGGCGGGGCGGAGGCTGTACGGAACTCCTTGCCATCGCCGCAGCTGCCCGCTAAAGGGACCTGCATCGATGTGTACGTCGCTATGGCCGCCAACCCGTGGAACTTCGTGGTTCAGCCAAACGTCACCAGAAAAATGCTACAAACAATGATGTCTTCATTGCAAGTGGAATGCCCAAAGCTGTCGGAAAGTGATGCTCCTACGAGTCCCGTCAGTGGTGAACTGTACACGGCCTTCTACGATAAAGACGACACGTGGTATAGGGTAACTATAGCCGGTTCTGTGTCCTCTGAGATGGTGTCTGTGTATTTCTGTGATTTCGGTGATCTGGCTTTGTTTGCGAACGAGTCGCTTCGTCCTGTGCCGGCCTCGGTGCCTCTAGCGCGATCCTTGCCACCGCAGGCTATCAAGGCTCGGCTTTACGATGTAAAGCCTTTGCATCAGGACTGGACAGTGGAAGACTGTATCAGATTCCAGGAACTATGTGTGGAGCAACAGTTTGTTGGTGTGTGTAAGGATGTTGGCAAGGACCCTTTGAATCCGACCGAACCGCTGGTCACCCTCGACCTGATAGATACCTCTACTGATGAAGACATTTATTTGAACAAGCAGTTGGTTGCTGAGGGAAGGGCTCGTTTAGCTTCCGTTTCCTCTACGAAATAA

Protein sequence:

>DPOGS203675-PA
MADKEQVIQALRATLLSVKGAITIKQCNRDYRELQGEWIPFKKLGYPTLEKLFQDVPGFKITQVNGDWVVDAIASQETQHIASMVARQKTNKKQIIKLNHTNRFPKKQGSWRKPTHTSNYTSSYNNQYSNQYYRVKTTPFNNNSYNGGKYYKNNKSIDSKNSSISSKTSTKGSENQDKPVLKSVNSSSQIIQKSNIISDDYKHSIKNEKETDSKPNRKVSASQRLSKLCDRLSTVDLIPLQLPAHNNLLDSTDPVSVACPPPKLEPPPDSTAVEKLEWTCRKLGLPPPVYKVNDIRPKRGPVSYACNIKVGTYHSMSYPVLSPSEEAARELAAVNLLAQVESLEAGGLVTASAASAVAGLTALVAEHSAGLLAKHVPHQYREKHGENLPNNWLELIENCPYILKDRVANDLLVLLPNIEEPTSPRVENFHQDNTLDTDLPPLQFPEDDFWNVFITVANSTLEVWLRIIGPQYSDAFEVLLVDMATYYEHSGTTVDKSMIILNAWYAVNLNDGGWQRAKILEIEDETATVFLGDHGDDDTVYFKNIKILEPQFRKLPAQAILCRLEGAEELAASEAGATLVRRRLPGEVLVAAPGPRRDPTDPSVAVVLYDTSTPRDLNLNKEIVHDFAISGAFTLTQKLCEVEVGCVTEEGRVWVSRAGGAEAVRAALALLTSGPYRRPLPAAPHAPPTTPNALYIVRTLAGDWVRCIIISGLDGEGTVRTQLVDSGLILRAPLSSLVPLQAFSPALNEYPYQAKQVRLGPAERAAGSMVSRLRDMLLGASVLCRALPASAPSPGSPPNVELYARWGPQHMLASVNDGILMEYELIQLGKLEEKKEGDTTNHLEALHRKKERIFGTGSTGDGGGAAGGAEAVRNSLPSPQLPAKGTCIDVYVAMAANPWNFVVQPNVTRKMLQTMMSSLQVECPKLSESDAPTSPVSGELYTAFYDKDDTWYRVTIAGSVSSEMVSVYFCDFGDLALFANESLRPVPASVPLARSLPPQAIKARLYDVKPLHQDWTVEDCIRFQELCVEQQFVGVCKDVGKDPLNPTEPLVTLDLIDTSTDEDIYLNKQLVAEGRARLASVSSTK-