Monarch geneset OGS2.0

DPOGS215863
TranscriptDPOGS215863-TA1929 bp
ProteinDPOGS215863-PA642 aa
Genomic positionDPSCF300029 - 1007161-1013070
RNAseq coverage449x (Rank: top 27%)
Annotation
HeliconiusHMEL0101060.057.70% 
BombyxBGIBMGA000410-TA6e-14059.25% 
Drosophilabora-PA1e-1634.74% 
EBI UniRef50UniRef50_UPI000178AB0A2e-2432.94%UPI000178AB0A related cluster n=1 Tax=unknown RepID=UPI000178AB0A
NCBI RefSeqNP_001122572.14e-2532.94%aurora borealis [Nasonia vitripennis]
NCBI nr blastpgi|1930832308e-2432.94%protein aurora borealis [Nasonia vitripennis]
NCBI nr blastxgi|1930832302e-2731.55%protein aurora borealis [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[54-73] IPR0232525.1e-17Aurora borealis protein
Orthology groupMCL12141 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215863-TA
ATGAGTAATAAAAATGATAGTCCAAATAACAGAAAAGCTACGCCGCCAACTAAAGGCAAAATCCGTAATCCTTTCGATAAAGTGTTAATTGAGAAATTGCATAAACCCATATGCAGTCCAGGTATGTGTAAAATCTACAAAAAAAAGAACAGAGGTTCTTTTCGATGGGATATAGATCAAGCGTGTGTTTTGGTGCCAACGGAAATTGTTGCTTGCAATAGTCAGTTTGAACCGTCACCCGATATAGCCTTAGAAAAGATTGCTGAGGAGGCTAACGAAAAATTTTTTTCCCAAGAAATGGTCATGCCGAGTCCTATGGAAACATCTAAGAAAGTGATACCTCTTCAAACATCTCTTGAGACCAGCATCCAAATCAACACTTCTATAAAGGAATCTATAGTAACTAGAGATGTTTCAGCTCAAACTGTACTTACATTGCCACCCAAACTGCCACCAGAATTAGAAAATTTATTGAAAACATATTACACATATACTCAGGATCAAAGTCAGAATATATTTGATGAGTATGAGGTGACAGCGAATGGGTCTTTAAGAAGGAAGTTGTTTTTTGAGGAGCGTGATGTAGAGCACAGCGATCATTATGATTCCGAGCAGACAGATGACGAGATCCACATAGATGCTCGTCCTTCATCCAGCTATGAAGCTCACACCCCGGTTACAATTAGCCCAAATTTAAGTTGTAACCAAGCATTAAAAGGTATGAAGAGGACATTCGGCACACCTCTATGTAAGGGTCAGAACAAACCGTCCTACCGCAACAAGATACTGGACGTTGTGGATTTCTGTTTGAGTCCCATAGAATTCCGTACTCCAAAGCGTTGTGTAGCGACGTCCTCACTGGCATCCGTATCACCCATACCAAAGGCAATGTCGAGTGACGACGAGAAGAACAACTTCACTTCCCCCGAATCAGAGGCCATGGCGACTTGTCTTAGCTGTATTGAATCTGAGACGGAGGTCGATAAGAAGTCGACCTGTTTCTGTATAACTCCAACAAAAATAATACTCAAACGGAGCACCTCATTGAAAGACTCGCCTCATAGGAACAGGAAGGGTTCGTTATCGGAGAAACGTAGTCTGAGTATGTCCAGTCTGAACAGGAGTCGCTCCGTGCAAAAGCTAGACTTCAGTATGGACATGAGTATTGATGGTTCCATACATGATAAATCTCAAGAGATTTCACCAAAGACTAAAGCGTCGTGGTCTATAGTAGAAGATTCAAACTTTAATTATGCCAAGGAAACACATAAGGAGTCGCACGAAGTACAGTCTTCCACTAAGTTATTTAGCGTTAATTTGTTAGATGATACACCCATAAAGGGTAAACATAAGACTAGTGTTCTAACACACGAAATTAGCAAAATACGAGGACAAGTAATGCTCAGCCCGTTGCATATGTCTCTAGATAATAGTTTAGATAATTTCGATATTCCGCTCAGTATGGAGGAAAAGAAAATCGATTTCAACACGGTCGATATTAAGCTTCTTACTGAAAATTCACAATTCGACAGTAACAGGACAGAGAATTCTATATTCAAGAGAGTCGACAGCGGCTTCAACGAAAACACGTTCTATGCGAACGCGTCCAGTTACTACGAAAGTGCCATAAAGCCTTCAGAATTGACAGTTACAAAGCAAAACAAGGCGTTAAAGGAAATATCTAATGTTAATTGGATGAGGGTAGACAGCGGTTTCAATGACACCGACAGTATTCAGTTATATGATTCCAAGAAAAATTCACGCTATCAAGGAGCGAGGTTGCCGGAGAAGGAGAATGTTGACGAGTTCGATCGCCTAAACAAAAACGAGCCCATGTCGATGTCGGACTTCATAACGGAAGATATAACATTCAACTGCAATTTTTCCTCTACACCGTCAAAAACAAAGAGCCGTAAAGTCAATTCGTAG

Protein sequence:

>DPOGS215863-PA
MSNKNDSPNNRKATPPTKGKIRNPFDKVLIEKLHKPICSPGMCKIYKKKNRGSFRWDIDQACVLVPTEIVACNSQFEPSPDIALEKIAEEANEKFFSQEMVMPSPMETSKKVIPLQTSLETSIQINTSIKESIVTRDVSAQTVLTLPPKLPPELENLLKTYYTYTQDQSQNIFDEYEVTANGSLRRKLFFEERDVEHSDHYDSEQTDDEIHIDARPSSSYEAHTPVTISPNLSCNQALKGMKRTFGTPLCKGQNKPSYRNKILDVVDFCLSPIEFRTPKRCVATSSLASVSPIPKAMSSDDEKNNFTSPESEAMATCLSCIESETEVDKKSTCFCITPTKIILKRSTSLKDSPHRNRKGSLSEKRSLSMSSLNRSRSVQKLDFSMDMSIDGSIHDKSQEISPKTKASWSIVEDSNFNYAKETHKESHEVQSSTKLFSVNLLDDTPIKGKHKTSVLTHEISKIRGQVMLSPLHMSLDNSLDNFDIPLSMEEKKIDFNTVDIKLLTENSQFDSNRTENSIFKRVDSGFNENTFYANASSYYESAIKPSELTVTKQNKALKEISNVNWMRVDSGFNDTDSIQLYDSKKNSRYQGARLPEKENVDEFDRLNKNEPMSMSDFITEDITFNCNFSSTPSKTKSRKVNS-