Monarch geneset OGS2.0

DPOGS202658
TranscriptDPOGS202658-TA1206 bp
ProteinDPOGS202658-PA401 aa
Genomic positionDPSCF300039 - 102667-107972
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0022312e-13578.28% 
BombyxBGIBMGA000856-TA3e-15173.55% 
DrosophilaCG16959-PA7e-9043.29% 
EBI UniRef50UniRef50_E2B0732e-9644.03%Putative uncharacterized protein n=6 Tax=Formicidae RepID=E2B073_CAMFO
NCBI RefSeqXP_001605624.11e-9845.27%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3320265284e-9945.07%hypothetical protein G5I_04807 [Acromyrmex echinatior]
NCBI nr blastxgi|3320265286e-11145.31%hypothetical protein G5I_04807 [Acromyrmex echinatior]
Group
KEGG pathway 
Orthology groupMCL12886 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202658-TA
ATGGCTGTGTGTTTGTGGTTGGCGATATTGTTTTATGGAAGTCATGCTGCTGTCCTTCCGCCTCCTTGGGCCGATGTCAGAAGAAACCCCTGTGCAGCTCATCCTCAAGGTTGGCTGATGCTGTACTGGCCTTCAGATGGAAAATGTTACACTATTTACAAGAAAGGTCACCCATGTCCAGATACTCAAGAGCTAAGCCCTGGCAGACTGGGCGGGAGGACGGTGGCCGAGTGCAAGTGTCCGCCAGGAACAGCCCAGCTGCCCCACACACACACCTGCCACAAGCTCTTCGAAAGAGGTCCCTGCAAACACGGAGAATATTTCGCACCTGTCGAAGAGTCTTTTAACATGCGAGGTGAACGTCAGGGTATGTGTGTGAGACCTCAGCAGTGCGCGGACGAGACCTTACTCTACTGGCCCCCCGATGGAAGATGCTACTACAGACTCACCCAGGGTCCGTGCTATCCCGGCCTCATCCTGGATGTGGGCAGCAACGGTCTGGCGAATTGTAGTTGTTCAAGGTCCAGCCCTCACTACTACCAGGGCTCGTGTTACGCTCACTACACCCGTGGTCCTTGCGAGCTCGGTCAACTGTTCCTGCCTGATGGAAGGTGCGGCTGTGAGGAACATCTGCCTCACTACAGCCGAGACGCTCAAGGATGTTTTGAACTCGACACGCTGGGGCCGTGTTCCAAGGGTCACGTATTCTCAATATCAGAAGTATCAGTTAATGGGTCACGCGCTGAGTGTCATTGTAAACGGTTCCACGCGCGAGCTGCAGATGGAGCCTGCTACAGACTGTACACGAGGGGTCCCTGCGACCAGGACGAGATGATCAACAGGGGGGGGAGGTGTACTAAGGTGCCGTGTGCTCGCGGTCGTCTGTACGTCCCGTCTCGCCGTCGCTGCTACCGTCCAGGGTCCGCGGAGCCGTGCGGTGTGGGCGAACACCTCGCCTTCGACTTCGAAGCGCGCCCCGCCCTCGACGGACTCAGTCATAACGGCGTGTGCGTGTGCGGGAACAGGCAGTGCGCTTCTGAACAGGTACAGTCATGTCTAACACGTAAGGGTGCTGCTATATACAAAGGTACGTGTCACGTTTTGTATACACAGGGCCCTTGTTCCGAGGGTCGCTGGCTCGCCATGGATGGCACCGGAGTGAGCTGCCAGTGTAGACCCGGCCTAACTTATTGCAAGGACAGCTGA

Protein sequence:

>DPOGS202658-PA
MAVCLWLAILFYGSHAAVLPPPWADVRRNPCAAHPQGWLMLYWPSDGKCYTIYKKGHPCPDTQELSPGRLGGRTVAECKCPPGTAQLPHTHTCHKLFERGPCKHGEYFAPVEESFNMRGERQGMCVRPQQCADETLLYWPPDGRCYYRLTQGPCYPGLILDVGSNGLANCSCSRSSPHYYQGSCYAHYTRGPCELGQLFLPDGRCGCEEHLPHYSRDAQGCFELDTLGPCSKGHVFSISEVSVNGSRAECHCKRFHARAADGACYRLYTRGPCDQDEMINRGGRCTKVPCARGRLYVPSRRRCYRPGSAEPCGVGEHLAFDFEARPALDGLSHNGVCVCGNRQCASEQVQSCLTRKGAAIYKGTCHVLYTQGPCSEGRWLAMDGTGVSCQCRPGLTYCKDS-