Monarch geneset OGS2.0

DPOGS201997
TranscriptDPOGS201997-TA1230 bp
ProteinDPOGS201997-PA409 aa
Genomic positionDPSCF300060 + 224427-226411
RNAseq coverage98x (Rank: top 61%)
Annotation
HeliconiusHMEL0026310.077.75% 
BombyxBGIBMGA010405-TA1e-13578.35% 
Drosophilafy-PA2e-4228.97% 
EBI UniRef50UniRef50_E2C6413e-7333.98%Protein fuzzy-like protein n=8 Tax=Formicidae RepID=E2C641_HARSA
NCBI RefSeqXP_001604141.16e-8438.22%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565505981e-8238.22%PREDICTED: protein fuzzy homolog [Nasonia vitripennis]
NCBI nr blastxgi|1565505981e-8238.22%PREDICTED: protein fuzzy homolog [Nasonia vitripennis]
Group
KEGG pathwaybfo:BRAFLDRAFT_1194847e-34 
 K12617 (PATL1, PAT1)maps-> RNA degradation
Orthology groupMCL13334 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201997-TA
ATGTCTGTAATAGTGGTTGCTGTTGCCTCAGAGAGTGGTGTTCCAATATTTTCAAGAAAACGCGGTAGTAATGAAAATATTCAATTTTCTACTATAGCGTCACTTCATGGGATAAATATGTTCACAAAATGTCACAACCTCTCTTTAATTAATACACACCTTGATAATGGAACAATAATTTGGAAGGAGTATTGTAAAAGTATTACTTTAATAGGAATAGCTACGGGTGGACTACAATGTGACCTTGAAAATCTATTGGCCTGTATACATGATGTAATGATATTTTGTATTGGCAAAAAAGAGTTGGAAAATTTAAAAAATGTTGACCAAATCAAAAGAGACTTGAGGCAGTGTTATCCAATTCTAGATTACTTGTTGGAATCTCTAGATCCAAATTCATTGCCACAGTCCACAATCATCCTAGATTATATTCAAAGTCTTCTTTGTCCTCAGGCTCAACAGTTGCAGGAAGTTTTGGACAACTATGCGCAGACTGTAACAGGAAGGTGGGCTTGTCTTAGCATTCATGGTCATTTGGTTGCCACAAGTTCTGACTTCTCTGAGTTAGATGCAAGAGAAGCCAAATTACTCTTATTATTAGCAGCGGCTCAAGATGGAGCCCCATTAAGAGATACCTTGGTGTATTTGCCACAAATGAGTCCAAATGTAGCATTCCGGGCTGTAACATGCAAACTTTTAGCAGATGTCTATGTGCTAGTTATCTGTGGAGCGACTCCTCCGCTGTCAGAAATAGATGAAATAGTCCTTCGATGTTGGGAAGGGTATGCACAAACTATAAAGGAGGCAAAACTGACCTATCCCAGGAACTTTCCTACCAGTTTCACATTTGATCCAGCTTTGCTAGGTGTATTAGTGATAAACGTTACTAAGAGACGCTGCGTATTTTCACGTCATTTACATGGATCTAATCAGAAGAGCCGAAGCATGTCAAATGCCCACAGGATTGATATACTAAGAACATTTTTTGTAACATCGGTAAAAGATTTAGTACCAGAATTCAGAAGTAATGAGAGTGAAGAAACAGATGTCTGCCAAAGTATTTTAAGCGAAACATTCTGGTGTTCTGAATACCACAAATGTCATATGCAGAGGTCTGGAAATATATTATGTTGCGGGCTTTATTCCCCGACTGTTCCTACACACACTATGAGGTTGATGACAAGTAATTTGCTACAAGATTTGCTCTCAAACAAAGAAATCTATTGGTAA

Protein sequence:

>DPOGS201997-PA
MSVIVVAVASESGVPIFSRKRGSNENIQFSTIASLHGINMFTKCHNLSLINTHLDNGTIIWKEYCKSITLIGIATGGLQCDLENLLACIHDVMIFCIGKKELENLKNVDQIKRDLRQCYPILDYLLESLDPNSLPQSTIILDYIQSLLCPQAQQLQEVLDNYAQTVTGRWACLSIHGHLVATSSDFSELDAREAKLLLLLAAAQDGAPLRDTLVYLPQMSPNVAFRAVTCKLLADVYVLVICGATPPLSEIDEIVLRCWEGYAQTIKEAKLTYPRNFPTSFTFDPALLGVLVINVTKRRCVFSRHLHGSNQKSRSMSNAHRIDILRTFFVTSVKDLVPEFRSNESEETDVCQSILSETFWCSEYHKCHMQRSGNILCCGLYSPTVPTHTMRLMTSNLLQDLLSNKEIYW-