Monarch geneset OGS2.0

DPOGS203277
TranscriptDPOGS203277-TA1968 bp
ProteinDPOGS203277-PA655 aa
Genomic positionDPSCF300003 - 1787707-1797108
RNAseq coverage430x (Rank: top 28%)
Annotation
HeliconiusHMEL0036312e-12358.49% 
BombyxBGIBMGA011005-TA0.056.43% 
Drosophilawah-PD8e-2733.61% 
EBI UniRef50UniRef50_D7EIT16e-4029.89%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D7EIT1_TRICA
NCBI RefSeqXP_966763.11e-4029.89%PREDICTED: similar to CG4699 CG4699-PA [Tribolium castaneum]
NCBI nr blastpgi|910937732e-3929.89%PREDICTED: similar to CG4699 CG4699-PA [Tribolium castaneum]
NCBI nr blastxgi|910937731e-3728.03%PREDICTED: similar to CG4699 CG4699-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL15958 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203277-TA
ATGGCCCCGGCGCTAACTGTGACACCCCAATATAACCACTACGACGCCGTCGAAGAAACTGAGAAGCTATCCCACCGAAGACGGGCCCTCGCGTCCGTCGATAACGTTTCTATACCCATGGAGGAAGACGATATGGGGCTTCAGAACCAACCCAGCCTGGCTAAGGACTCTCAGGAAATGGAACAGATCCTAGTAGATTTAGGTAACAATGACCTTGTTCAGGCTGATTTACTACAAGCGATCAAAACCTTGGAAAGTGGTGGCGACTCCCTGTCTACAGGCGACCCAGATGGTATGTTTCCCTTAAGTGGCTTCGATCTAGCTGACACAGTCGATTCAGAGAGCGATGCTACCGAGGATAAGATAAGATTAATACAAGCGCGTTTAGAACGCAGGTGTGCGTTTCTATTGAGACGGTTGAGAATATTGCAAGCGAGAGCAATCGGCAAAAGAATATCAGAGGAAGCATCACAGACTTTCGAAAAATGTGCAAAGGGCGCACGGAGAGATGGCGGTGGAAGGCCAATGGGGCTGAAGGCTCTACTAAAAAGGATAGAGACAACAGCCGCTTTACAAGCGAGTGCTGCATCACGTTCTGTGGTCGGTCCTAAGTATTACCGTGCGGGGACTTCGAAAGGTGATGCCTCAAGATCTGCCTCTATTGGAATACCTTCAGGGACTCTGACTGGTTTGGAAGATTCGGCTGGTGCGCTGAGATCACACTTATCTGTAGTAAAACATCAATTGGATTCTGATGCAACAGCTTCGTCTTCTGGAGCCGAGAGCAACGATGAGGCAGTCACCTACAACAACACTCACCAACAACCAATGCCCATTATCAAATTTCTGTTGATTAAGAATAAAGTCCGTGAAGCTAAAGGTCCGGTAGAATTCGAAGTTAGTGGCGAATCCCAGTTCGAAGACACGTGTTCCAGGGTCAGACCTCTCAGGAGGGACACGTTTAATAAGAGGAAGTTGCTGCAGATGCACAACTTACACATAGCAACCAATAAGGCCGCGAAACCATCAGATATCAATTGTCGTTGTGTGGGTAGTTCGTGTGCGGTGTGTACGGGACGGTTCGAGGCCACACAACCGGCAGCTCCGAGCGGAATGCTACCACCAGCAGCGCGTCGGGCATTAGTCGATCCTTCGCATCACCCTGTGCTCAGCGACGTTAATGACCTTCGTCCGTCTGTTCATCTATCGGCGTTAACGTCACGCTCGTGGTTCAGATCGCGAGTTACAAAGTCGTGGCGCGGAGCACACACCGGTACAGCACCGAGAGCGCCGCCAGCACCGCCCCCCAGACACAGGCGGCTAACAACTAGTATGGGTCGAGGTCGGTCGAGTACTGAGAACCGCTTGTGGCGGCGGCAGTCTTACGACATTGACAACATCGTTATACCCCAGAGCGTCGCCGCCAGCACTCGTCCGGAGATACTCACCTATAAAGAAATTATTACACCAAAATGGAGAGTCATGGAAATACCAGAAACACCGCTCAACAACGGTGTGTCCAAGTCTAATAGGATGTCCATAGAGAGTGATGACGAGGATATATCAGAGGCAGCGGTGCAGGCTCGTCACACACGGGCCGAGACACGAGAACGTAACAGATATCTCCGGAAGAGAAGATCTAGGAGACGTAACACTGAAGAAGAGAATAATGATCCCATACCGGAAGTAGTGGTTCGACAGCCTACACCGCCTCTACAGGAGACAGTGCCACCTTACTCACCGAGGCAGTTCCCTCTCAAAGACGATCTCTACCAGGATATGTTATCCAAAATGCCAGAAGGTTATCGACCCATCAGCCCCGATTTAGATCCGGATATAACCATGGAAGAAGACACGAGTTCCCTATCACCGTTGTCACCTTTGAACTTTGATGGTGATGATCCCGATGATGCCGAATGGAATCCCAGCAATGAGAAGTCAGATAAAAGAAGAAGTACGCTAAGATAA

Protein sequence:

>DPOGS203277-PA
MAPALTVTPQYNHYDAVEETEKLSHRRRALASVDNVSIPMEEDDMGLQNQPSLAKDSQEMEQILVDLGNNDLVQADLLQAIKTLESGGDSLSTGDPDGMFPLSGFDLADTVDSESDATEDKIRLIQARLERRCAFLLRRLRILQARAIGKRISEEASQTFEKCAKGARRDGGGRPMGLKALLKRIETTAALQASAASRSVVGPKYYRAGTSKGDASRSASIGIPSGTLTGLEDSAGALRSHLSVVKHQLDSDATASSSGAESNDEAVTYNNTHQQPMPIIKFLLIKNKVREAKGPVEFEVSGESQFEDTCSRVRPLRRDTFNKRKLLQMHNLHIATNKAAKPSDINCRCVGSSCAVCTGRFEATQPAAPSGMLPPAARRALVDPSHHPVLSDVNDLRPSVHLSALTSRSWFRSRVTKSWRGAHTGTAPRAPPAPPPRHRRLTTSMGRGRSSTENRLWRRQSYDIDNIVIPQSVAASTRPEILTYKEIITPKWRVMEIPETPLNNGVSKSNRMSIESDDEDISEAAVQARHTRAETRERNRYLRKRRSRRRNTEEENNDPIPEVVVRQPTPPLQETVPPYSPRQFPLKDDLYQDMLSKMPEGYRPISPDLDPDITMEEDTSSLSPLSPLNFDGDDPDDAEWNPSNEKSDKRRSTLR-