Monarch geneset OGS2.0

DPOGS203052
TranscriptDPOGS203052-TA1494 bp
ProteinDPOGS203052-PA497 aa
Genomic positionDPSCF300206 + 74589-76082
RNAseq coverage723x (Rank: top 18%)
Annotation
HeliconiusHMEL0161446e-14464.47% 
BombyxBGIBMGA006542-TA8e-11154.03% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|3312376174e-1230.92%hypothetical protein PGTG_13265 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Group
KEGG pathway 
Orthology groupMCL15041 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203052-TA
ATGTTCAAGAAAACGTTTTTCGTTCTCACAGTAGTGCTTCTTATTTTTGTGAATGATGTAATAGAAGCAAGAAAATTATCAGGTAGTCGCAGTTCTAGTTCCGGCCGAAGTTCTAGTAAACGAACAAATCCAAAACCAATACCGACATCCTTCAGTTATCCACAATCTTCAGCACCTAAACCATCTTTGTTTGGCTGGCAAGAAAAACCTGCTCAAAAAACTCAAGTGTCGTCTCAAAAATCAAAGCCATCAAATCAAGGACATTCTTATCCATCAAGCAACACAGGATTATCTGGCAATGGGCAACCTAAACAACCTCCCACACAGGAAGGAAGCATGCAGAGAAGCAATGTGCCGTCTCAACATTCTTCACCGAATAAACAAAATGTTAATGAAGCGAGTCATACTTATCCAGCATCCAACAGTCTATCAGGTAACTCTGGAACTAATTCTGGGACTGGATATCCACAAGGAACTGGATTATCGGGTTCAAATTACCCAGCACAGCCTTCATCAAATATAAATACAGCTAATGGATATCCCGTTAACAATGTTGGACCTTACCGTAACACGCAAAATGCACCGCCACCATATTCAAGTGCAGGGAATACAAACTATCATCCCCAAGGGCCACCGCCGCCTTATACTAATGTAGGAAATAGTAATTATCATCATCCTCAACATCCGCCACCACCTTACACTAACTATGGACATAATTATGGAGGTTCTGGTGGATCTTACAGTCCTCAAGCCCCTGGATACTTTGGAAACTACGTTAATCCAGGAAAAAATTATGGAGGTGTGAGTCGATCAGGCAATGTTTTAACAGGAGTCGGAATTGCAGGAGCAGGAATAGGGACTGTTTTAACAGGTTTAGCATTGTGGAATTTAGCAAGATCAACTGGTCATCATCATCATACAGTGATTTATGACAATCGTGGTCAACCGATCGCTGTAGCTCCAGATAACAGTACAACACCGGTCGTGGATCCAATATTAAGTGATTTAGTCAACTGCACACTCACGATTAATAATGGTAATACAACAGAAGTTCTTGCAATACCCTGTTCGATTGCAACATCATTCACTCCGGATGCCAATGTTAAAGATGAGAGTCTAAATAAGGAATCAAGCGATAATACAGAATGTATTATAACTGTGGTTACAAAATCGTCTAAGGAATTTATGACATCCATACCTTGTTCGGTCCTTCTTAATACTGCAGCTGAAAATAATGTAACCGAGGCTCCTATCCTGGATACAAATACGACTATAGAAAATGGTACTTCGGTTTTATCTCCTCAAGAAACCTTAAGTGCTGACCAACCCACGGCTCTACGTCTTTCTTCTTTAGAAGATGAGAACTTCAAAGAACCTAAAACTACATTAAACTGTACTCAAGAACAAGGAGAAATTCGGGATCCCATTAATCCTTGTTTTAGCGTGAAACATAATTTAACTGTCATTCCCTTAGAAACAACTGTTACACAATAG

Protein sequence:

>DPOGS203052-PA
MFKKTFFVLTVVLLIFVNDVIEARKLSGSRSSSSGRSSSKRTNPKPIPTSFSYPQSSAPKPSLFGWQEKPAQKTQVSSQKSKPSNQGHSYPSSNTGLSGNGQPKQPPTQEGSMQRSNVPSQHSSPNKQNVNEASHTYPASNSLSGNSGTNSGTGYPQGTGLSGSNYPAQPSSNINTANGYPVNNVGPYRNTQNAPPPYSSAGNTNYHPQGPPPPYTNVGNSNYHHPQHPPPPYTNYGHNYGGSGGSYSPQAPGYFGNYVNPGKNYGGVSRSGNVLTGVGIAGAGIGTVLTGLALWNLARSTGHHHHTVIYDNRGQPIAVAPDNSTTPVVDPILSDLVNCTLTINNGNTTEVLAIPCSIATSFTPDANVKDESLNKESSDNTECIITVVTKSSKEFMTSIPCSVLLNTAAENNVTEAPILDTNTTIENGTSVLSPQETLSADQPTALRLSSLEDENFKEPKTTLNCTQEQGEIRDPINPCFSVKHNLTVIPLETTVTQ-