Monarch geneset OGS2.0

DPOGS206079
TranscriptDPOGS206079-TA1431 bp
ProteinDPOGS206079-PA476 aa
Genomic positionDPSCF300028 - 180348-183263
RNAseq coverage358x (Rank: top 33%)
Annotation
HeliconiusHMEL0057700.079.18% 
BombyxBGIBMGA006861-TA2e-9069.37% 
DrosophilaCG6406-PB3e-9147.57% 
EBI UniRef50UniRef50_Q7PT631e-10447.82%AGAP007375-PA n=3 Tax=Culicidae RepID=Q7PT63_ANOGA
NCBI RefSeqXP_624097.22e-11247.42%PREDICTED: similar to CG6406-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3504180891e-11347.35%PREDICTED: hyccin-like [Bombus impatiens]
NCBI nr blastxgi|3504180894e-10847.35%PREDICTED: hyccin-like [Bombus impatiens]
Group
KEGG pathway 
InterPro domain[20-344] IPR0186197.4e-100Hyccin
Orthology groupMCL13072 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206079-TA
ATGGCGGAATGGAAGCATTTAATAACAGAATGGTTCAATGAATATGCTGTTCTAAGAGAAAATGAGATCAAAAGCTTTGCTGCCGAACATGAGCATAACCATGAAATTGCAACAGCAATTTTCAATTTACTCTACAGTGAAGACGAAAGTGAGAACCAAGTGAAAGGTCAAAGAAATGAAGTAGATCAACAGATGCTTGAAAATGTTTGCATACAACTATTCAGTTTCTACAGATCTAAAGAGGTTGAATTGCAAAGATTTACATTACAGTTTGTTCCCACTTTAATTTACACATACCTCAGTAATGTTGCCCAAGGCAATAGTAAAGCGTGCCGGTGTATAGAAACATTACTCATTGGTCTTTATAACTTTGAAGTGGTTGATGAAAATGGTAAACCAAAAGTTGTGTCATTTAGGTTGCCTTCATTGGCGCAGGCCTCTATATATCATGAGCCATTATCCCTTGGATCTCAGTTCCTAACAGAAAGTGCTCTCCGACGATGGGAAGAGTGTAATACTAAACTTGTAAGATGGGGACCACATGCTCAAGTGGAGACAATAAATGCACAAAACCGCCTTAAAGTCATGGCTGCTCTATTATTTATTTATAATGGTCAACTGAGCTTACTGCCAAAACTCTCTTTACGCCATTTCTGTATTGCTGCATCTCGAATAGTAACACAAGGCTTTAATAAGAAACCTGGCACCAAATCTATTCAAAGGATACCCGTGTCCTCAAATTTCATGTTGGAGATGATTGAGGGAGCATATTATGCTATGTTCAATGAATTTTACACATTTGCTCTTCAAGCTGTTAAGGACATAGATCAGAGGGCCCAGTATGAACTGCTGCCAGATGTGATGTTAGTCACTAGCGCTGTTATTAATTCATTAATGAATAACCCAACGGGCCAACCCTGCGATGGACCTATGGGTATAAGTGTGGCGCTATCCCCAGCTACAACTACAGTTACTATGTCTAAATCTATGATAACAAATGCATCATTCCGTACGAAAAAATTACCAGATGACATTCCAATCCAGGCCGGTCAAGCCGTTTCGACGGATTCAGCCGAAATGCTGACATCCATAACAGAAGAAGGCGAAGATACTCCCATGCAACGCGGCGCTGTGAGGAGCTCCAAACCACGACACGGCGGGCTGCTCAGCAAGAAGAAAGATATCAAAGACAAGACGAATGATAAGAAAACACCAACACAGAAGGGGATCTGGAACAGCCTGAGTGGTGGTGGAGACATGGTCGACGCGAACGCTGACGGCTTCGAAGCCAACGGAAGCGTTAAGGACGAAATTTCGATGACGTCAGTCCAGAACAGTTCAGAAAATTCGGACTCGCGATCCCAGGTGACCACGGATTCTTTGGACATGGAGACGCCGCGGTTTGCCGCGATGCAAGTTAGCTCTGTGTGA

Protein sequence:

>DPOGS206079-PA
MAEWKHLITEWFNEYAVLRENEIKSFAAEHEHNHEIATAIFNLLYSEDESENQVKGQRNEVDQQMLENVCIQLFSFYRSKEVELQRFTLQFVPTLIYTYLSNVAQGNSKACRCIETLLIGLYNFEVVDENGKPKVVSFRLPSLAQASIYHEPLSLGSQFLTESALRRWEECNTKLVRWGPHAQVETINAQNRLKVMAALLFIYNGQLSLLPKLSLRHFCIAASRIVTQGFNKKPGTKSIQRIPVSSNFMLEMIEGAYYAMFNEFYTFALQAVKDIDQRAQYELLPDVMLVTSAVINSLMNNPTGQPCDGPMGISVALSPATTTVTMSKSMITNASFRTKKLPDDIPIQAGQAVSTDSAEMLTSITEEGEDTPMQRGAVRSSKPRHGGLLSKKKDIKDKTNDKKTPTQKGIWNSLSGGGDMVDANADGFEANGSVKDEISMTSVQNSSENSDSRSQVTTDSLDMETPRFAAMQVSSV-