Monarch geneset OGS2.0

DPOGS209064
TranscriptDPOGS209064-TA1911 bp
ProteinDPOGS209064-PA636 aa
Genomic positionDPSCF300102 + 270606-278139
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0052736e-7080.49% 
BombyxBGIBMGA010056-TA6e-8083.57% 
DrosophilaCG11693-PB1e-2451.09% 
EBI UniRef50UniRef50_D6W7833e-4056.50%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W783_TRICA
NCBI RefSeqXP_973262.16e-4156.50%PREDICTED: similar to CG33257 CG33257-PA [Tribolium castaneum]
NCBI nr blastpgi|910827491e-3956.50%PREDICTED: similar to CG33257 CG33257-PA [Tribolium castaneum]
NCBI nr blastxgi|2700150761e-6057.96%hypothetical protein TcasGA2_TC014239 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[340-519] IPR0079992.6e-39Protein of unknown function DUF745
Orthology groupMCL16536 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209064-TA
ATGAGGGCCTTCACGATCGTCTTGCTTTGCGTCGGAGCGCTCGCCTCCGCATCATCAGCCCCAATCAATCAAAACTCGGCTCAAAGAGCAGAATACAACTATGACTACGAAGGCGATCAAACAAACCAAGCCAAAGACCCAGCGGAAGCTGCAAGCCATTACGACTCTTATATCCAGGACGTCAAGGCAGCAGCTAACGGGGGCGGCGGGAAAAAGGGCTTCAGCAGAGGCAGTGGCTTACGAACGATAGCCGTGGGTTCAGCTAATCAAGCAAAAACAGCGCTCGGAAACCAGCAGGCCGCGGCATACCAAGCGGCGTATGTAGCTAAGAATACTTTAGCGCAATCCGCTGCCCAGTCGAGTGCCACAGCGCAGGCAGCGCTGGCCGGGAAGCAGGTTATACTGTCTGGTCTGGAACAGCAAGTGAGGGACGCGAAGGTTGGATTACAAGGAGAGGAGATGCAACTCCAGCAGGCGAAGAGAGCGGCGCAGTCAGCAGCACAAGCAGCTCAACAAGCGATGCACCAGGTGAACGTGATCCAAGCAGCGCTAAACGCAGCTCAGGCTACATCAGAAAATGCGAACGAGGCAGCGTCTCAAGCTGCGGGGGAGCTCGGAGCTCAGACAGCCATGGTGGGTGCAGCGCGACAGAGACTGTCTACTCTTCAGGAACAACTCAAAGGAGTCAGAATCGACTTTGAAGCCACTCAAGCTGCAGCTAGGAAGGCTCAGGCGGCTGCTCAACAAGCTCAAGCGAATGCCGCTGAAGCTGCAGCGAAGGCCGCCGCGGCTGGCCTGGCAGCTAACAAGCCCGACGCCTCACACGAAGGAGCGCTCGCCTCCGCATCATCAGCCCCAATCAATCAAAACTCGGCTCAAAGAGCAGAATACAACTATGACTACGAAGGCGATCAAACAAACCAAGCCAAAGACCCAGCGGAAGCTGCAAGCCATTACGACTCTTATATCCAGGACGTCAAGGCAGCGGCTAACGGGGGCGGCGGGAAAAAGGGCTTCAGCAGAGGCAGTGGCTTACGAACGATAGCCGTGGGTTCAGCTAATCAAGCAAAAACAGCGCTCGGAAACCAGCAGGCCGCGGCATACCAAGCGGCGTATGTAGCTAAGAATACTTTAGCGCAATCCGCTGCCCAGTCGAGTGCCACAGCGCAGGCAGCGCTGGCCGGGAAGCAGGTTATACTGTCTGGTCTGGAACAGCAAGTGAGGGACGCGAAGGTTGGATTACAAGGAGAGGAGATGCAACTCCAGCAGGCGAAGAGAGCGGCGCAGTCAGCAGCACAAGCAGCTCAACAAGCGATGCACCAGGTGAACGTGATCCAAGCAGCGCTAAACGCAGCTCAGGCTACATCAGAAAATGCGAACGAGGCAGCGTCTCAAGCTGCGGGGGAGCTCGGAGCTCAGACAGCCATGGTGGGTGCAGCGCGACAGAGACTGTCTACTCTTCAGGAACAACTCAAAGGAGTCAGAATCGACTTTGAAGCCACTCAAGCTGCAGCTAGGAAGGCTCAGGCGGCTGCTCAACAAGCTCAAGCGAATGCCGCTGAAGCTGCAGCGAAGGCCGCCGCGGCTGGCCTGGCAGCTAACAAGCCCGACGCCTCACACGAAGGATCCGGTCCCAACCAGGATCCAGTCACATTGCACGCAACCCGAAGATCCAAACCCATCCAAAACATCAAAACCCTCACACTCAACCTCTCACAGAATCCAGGCCAATGGCACAAACAGAAACCACCAGTACAGTTCCATGAACCGTTAAAAATATCAAAAACTAAAACAATAAAGCTGTTAGATTCTGAATCATTTGAAAAGTTCCCAACCTTTGCTGACGGCGAAAATCATGAAGAAGTTGAGGAAGATGTAGAAGAAGAAGGTGAAGAAGATTGTGAGGAATAG

Protein sequence:

>DPOGS209064-PA
MRAFTIVLLCVGALASASSAPINQNSAQRAEYNYDYEGDQTNQAKDPAEAASHYDSYIQDVKAAANGGGGKKGFSRGSGLRTIAVGSANQAKTALGNQQAAAYQAAYVAKNTLAQSAAQSSATAQAALAGKQVILSGLEQQVRDAKVGLQGEEMQLQQAKRAAQSAAQAAQQAMHQVNVIQAALNAAQATSENANEAASQAAGELGAQTAMVGAARQRLSTLQEQLKGVRIDFEATQAAARKAQAAAQQAQANAAEAAAKAAAAGLAANKPDASHEGALASASSAPINQNSAQRAEYNYDYEGDQTNQAKDPAEAASHYDSYIQDVKAAANGGGGKKGFSRGSGLRTIAVGSANQAKTALGNQQAAAYQAAYVAKNTLAQSAAQSSATAQAALAGKQVILSGLEQQVRDAKVGLQGEEMQLQQAKRAAQSAAQAAQQAMHQVNVIQAALNAAQATSENANEAASQAAGELGAQTAMVGAARQRLSTLQEQLKGVRIDFEATQAAARKAQAAAQQAQANAAEAAAKAAAAGLAANKPDASHEGSGPNQDPVTLHATRRSKPIQNIKTLTLNLSQNPGQWHKQKPPVQFHEPLKISKTKTIKLLDSESFEKFPTFADGENHEEVEEDVEEEGEEDCEE-