Monarch geneset OGS2.0

DPOGS201072
TranscriptDPOGS201072-TA1389 bp
ProteinDPOGS201072-PA462 aa
Genomic positionDPSCF300185 - 226736-229996
RNAseq coverage227x (Rank: top 44%)
Annotation
HeliconiusHMEL0058370.073.88% 
BombyxBGIBMGA007164-TA2e-16761.91% 
DrosophilaCG43088-PA3e-1823.83% 
EBI UniRef50UniRef50_UPI00020613052e-3930.36%UPI0002061305 related cluster n=5 Tax=unknown RepID=UPI0002061305
NCBI RefSeqXP_001943491.11e-4635.29%PREDICTED: hypothetical protein [Acyrthosiphon pisum]
NCBI nr blastpgi|1936107372e-4535.29%PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1936107371e-4335.20%PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Group
KEGG pathway 
Orthology groupMCL10102 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201072-TA
ATGGCTGATAAGCCGCCAGAAGACAGCTACATATCTGTTGTTGACGAAGAACCTGTGTTTTTTGAGTTACTCAAATGGGATTCGTCACAATCGCAAAAGCAACAAGAGCCGAGGAGTTTAGAAAAAGCAAAAAGTCCGGAGAAAATGGTGACAAAATCTAAAGAAGAGTCAGATCCGTTTGATTTGAGTGATGCCGCCTTTTTAGATATGTATCGACTTTCAAAAGATCTGGCGCGAAATCTTTGTGAGGAATTGAAACCTGTTATGCCCGATTCTATTAAATCGATTGAGTTTTCAGTCGAAAGTAAAGTTTTAGCAGCTTTATCATTCTATGCTACTGGCAAGTATCAGAAATCAATAGGGGGTAGATCGGACCCCAGTATAACTCAGTATTTTGTGGCAACAGCGGTGATGCAGGTCACTGAAGCTATGAATGACCCCAGTATTATTAAGAAATATATACACTTCCCACATTTGAGAAATGAGAGGGAAGTCATCAAAAATGGTTTTTACATGAAGTATGGCATCCCTAATGTTGTTGGCTGTGTGGACTGTGTGCATGTGCCCATCGCCCGGCCCGATGAAGATCAGAAGAAGCACTTCAACAAATCATACCACTCTAAGAAAGTACAAATAATAAGCGACAGTCGCCAGCGCATCATGAGCGTGTGTTCTGAGGGTGGAGGCTCATACTCCCACGACGCTCTGCTGGCCAGACACGCCGTCACCGTGGACCTGGTCAGTCTGAACAACTCACGGGATCTCTGCTGGCTGCTAGGCGGGCCGCATTACTCACAGAAACCGTACCTGATGGCCCCAGTGCCGAAAATGACGAAGAAGTCTTCCATGTCACCGGAAAAGTATTACACGAACCTGCACGCGCAGGCGCACTCGGCCGTCACGGAGACTATCAAACAGTTGAAGGCGCGCTGGAAGTGTCTGCAGGCCACCAGCAACAAGCAGTTCGACCCGCCCACCGTCGCCAAGATGGTCCTCGCCTGCTGCGTGCTACACAACATATGCACGGAGCACGGCATTCCGCCCGTGGACATGACGCAGGCCGAGGAGCGTCTGGAGGCCATGAAGCAGAGGGTGGCCAACGCCCCGGCCTCCAGGAGACAGGAACACGACCAGCTCGGCCTGCAAGCGCGGGCTGCGCTCATACAGAGGCTGTGGGCCGAGAGGAGCATCACGACCGACGCCTGCCCCGCCACCAAGAGGAGGCTGGCGAAGAAGGACCGGCCGCCGGAGACCCACCCTGTACATCACCCAGAGGTGCATCAGCATCAGATGCACGACGACCCCAAGAGACCCAGAATACTCATGAACAACCCCTACAGCATCGGAGTGGGCATGCCGCCGGCCTGGGGTCACTACCCGCAACACTGA

Protein sequence:

>DPOGS201072-PA
MADKPPEDSYISVVDEEPVFFELLKWDSSQSQKQQEPRSLEKAKSPEKMVTKSKEESDPFDLSDAAFLDMYRLSKDLARNLCEELKPVMPDSIKSIEFSVESKVLAALSFYATGKYQKSIGGRSDPSITQYFVATAVMQVTEAMNDPSIIKKYIHFPHLRNEREVIKNGFYMKYGIPNVVGCVDCVHVPIARPDEDQKKHFNKSYHSKKVQIISDSRQRIMSVCSEGGGSYSHDALLARHAVTVDLVSLNNSRDLCWLLGGPHYSQKPYLMAPVPKMTKKSSMSPEKYYTNLHAQAHSAVTETIKQLKARWKCLQATSNKQFDPPTVAKMVLACCVLHNICTEHGIPPVDMTQAEERLEAMKQRVANAPASRRQEHDQLGLQARAALIQRLWAERSITTDACPATKRRLAKKDRPPETHPVHHPEVHQHQMHDDPKRPRILMNNPYSIGVGMPPAWGHYPQH-