Monarch geneset OGS2.0

DPOGS214666
TranscriptDPOGS214666-TA1491 bp
ProteinDPOGS214666-PA496 aa
Genomic positionDPSCF300321 - 12637-17029
RNAseq coverage127x (Rank: top 57%)
Annotation
Heliconius% 
BombyxBGIBMGA001878-TA9e-16254.27% 
DrosophilaCG15651-PA2e-5427.17% 
EBI UniRef50UniRef50_E2BE145e-10240.97%Fukutin-related protein n=5 Tax=Formicidae RepID=E2BE14_HARSA
NCBI RefSeqXP_972129.23e-10941.37%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|1892396416e-10841.37%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastxgi|1892396414e-10441.11%PREDICTED: similar to predicted protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL14822 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214666-TA
ATGTTACATCGTCTATTTCGCGGTCGCTTTACATTCAAAGTATATCTACTTTTGGGAATAATTGCCATTATCGTATGCGGCTATTTACTCCCACAGTATAAAATATATCCCATAGCACCCGTAACATCACCCAGTTCTGCTCACTCACACGCTTCACGACATATATCAAAATTAGTGACGGTTGTGTTTCGACAATTCGAAGATTTTGAAAATGATATCGCTGAAGGTGTACAATCCTTTGTGTCAGCGTACCCGAATATTGCCATAATAGTAATATGTCAAAGAACGCAGTATCCCCCCTTTCAATTTTCCGGTACAAATGAAACGTTGAAAAATGTGAAAATATTATCAATGGAGCTTAAACTCAATAGTTCTCCGCGAGATTTGGATCCACTTTCTTATATATCTACCGAGTATGTACTGATAGTACCTGATTCGTCGCGTGTATCACGGCGGGTGTTCCAACAAATGACCGTGGCGGCAACAACATACCCAACTCAAGCTATTGCCATAGCAGTTGGTAACGCTCGTCTTAGTTGCCAGCAAATTAAGTGGGCATATGACGACTGGACACTGCAATACAGCAAAGAATCTTCTAAAAAGCTTTGTGATGCAGTGCAGGGACAACACGCACTCATGATTAAAACTTCAGTATTACACACCCTGCCGAAACCGTTCTCATTCCCTTTCCCAGAGTCTTTATATTTGCAGACTGCAGTAAAAAATGTTAAGGTACAAATTTTGGATAGTAAGTTTGCAGCTGGGCGGAGTGTTCTAAAGACTCCAGCTGCTAAGCAGAAGTCATCAAAAAGATTGAGGGACTATCGGACAGCATTGTACAAAGATATTGGTGTGAAAGCTGTTATTAAGGAAGATGGTCGAGTGCAGTGGTTTGGATGTAAGAGGGAAACACAGCGCTGTTTCCCTCCAGTGCAACGTGTACCGTCATACGTGGTGTCCGGTCGCAACACACCGCCTTGCTGTCGCAGGAACATCCGCCGTACAACTGGTCACGTGTTGCGCGCTCTACTTCAGGCCGGAGCGAGATGTTGGTTGGAGACAACCTCACTATTAGGTGCTGTTGTCAATGGGGATTTACTACCATGGGCGGAGTATGCAGAAATAGGGATCCATGCATCAGATCTGTCAAGAGTGTCGTGGCTGCAGAGAGGTGGCGCTGACAATGATGGTTTTGTTTGGGAACGGGCAACCAAAGGGCATTACTACAGAGTAGCCTATTCAGCAACAAATAGAGTATATGTTCTTATACTGCCCTTTACTGCCAAAAATGGGACCATGTGGCCGGCGGATTGGGTCCTATCACACCAACGGGACTTCCCGGAAAGACATTTGCATCCTCTAGCACAGATTCAATTTGTTGGACGTCAGGCTCCGGCCCCGAACGATGCTCGCGCATTTTTGGACCTCAAACTAGGTCCAAACGCTGTCGAAAGAAGCGAGAAGATTGGGCCAAGACTTCTATACCCATAG

Protein sequence:

>DPOGS214666-PA
MLHRLFRGRFTFKVYLLLGIIAIIVCGYLLPQYKIYPIAPVTSPSSAHSHASRHISKLVTVVFRQFEDFENDIAEGVQSFVSAYPNIAIIVICQRTQYPPFQFSGTNETLKNVKILSMELKLNSSPRDLDPLSYISTEYVLIVPDSSRVSRRVFQQMTVAATTYPTQAIAIAVGNARLSCQQIKWAYDDWTLQYSKESSKKLCDAVQGQHALMIKTSVLHTLPKPFSFPFPESLYLQTAVKNVKVQILDSKFAAGRSVLKTPAAKQKSSKRLRDYRTALYKDIGVKAVIKEDGRVQWFGCKRETQRCFPPVQRVPSYVVSGRNTPPCCRRNIRRTTGHVLRALLQAGARCWLETTSLLGAVVNGDLLPWAEYAEIGIHASDLSRVSWLQRGGADNDGFVWERATKGHYYRVAYSATNRVYVLILPFTAKNGTMWPADWVLSHQRDFPERHLHPLAQIQFVGRQAPAPNDARAFLDLKLGPNAVERSEKIGPRLLYP-