Monarch geneset OGS2.0

DPOGS207497
TranscriptDPOGS207497-TA1536 bp
ProteinDPOGS207497-PA511 aa
Genomic positionDPSCF300051 + 800773-808058
RNAseq coverage636x (Rank: top 20%)
Annotation
HeliconiusHMEL0123303e-14281.79% 
BombyxBGIBMGA009839-TA0.070.24% 
Drosophilahfw-PA2e-8140.58% 
EBI UniRef50UniRef50_D6WG701e-9341.77%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WG70_TRICA
NCBI RefSeqXP_967806.22e-9441.77%PREDICTED: similar to AGAP000601-PA [Tribolium castaneum]
NCBI nr blastpgi|1892352263e-9341.77%PREDICTED: similar to AGAP000601-PA [Tribolium castaneum]
NCBI nr blastxgi|1892352261e-9541.99%PREDICTED: similar to AGAP000601-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL14688 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207497-TA
ATGTCGAATTCCGCAGCATCCTTAGTAGCAATATGGTGTCTGGTGGCGCTGATCACGTTCAACACGGGGCTGATAGAAACTAGAATGGAGGAAGTGGACTCCAAGTACAAAAGTGTGGAGCCGCATCTGACTTCCACGGAGAAGCCGAAGCTCGAACCTCACGTCGACCTGAAGATGGGCAAGTGTTTCCACATGTCCCGCGAGCACTGTCCCACGGAGACCAAGTGCAAGAGAGTGGACCCGGTCTCCGTGTTCTGTTGTGATGTAGACAGCACCAGGCTGAAGGAAGCCCTGGCGATCACTGTGACTCAGAACACCGTCAACCTGCACGTGTTGAACGCCACCATCGAGGAGTTGGACGTGTCCCAGTCCATCTTCCGTCGCCTCACATCCATGGCGCTCACTGACGGGAACATCGGGAGGATTGTTGGCCAGTTCCCTAAGTACTCGTCCATCGCGTGCCTGAACATCTCCAACAACAACCTGAGCTCGGCCACGGTGGGCGGTCCGCAGCCGGTTCAGCGCCCCTTCGCCTACCTCTTCACCCTGTCCGTCCTGGACGCCTCCGCCAACAACCTCACAGAGTTCCCCCTCAGCCTCATGCAGAGCAACCGCAAGATATTCCTGGATCTCTCTGGTAACAATTATCTTCCGTGCAAACACTTCCTAATAGCCATGGAGGCCAACAACAGTTCCTTGGTGACCTTCCTCAACTACAATCGAACTTTCTGCGCTCTGGACCTCTTGTTCAACTGGTTCACTGAGCTGCAGATAGTGCCCATTGATCACCTCCGGGTACAGAAAGAGGTGAACGCGAGCTGTCTGAAGATCCGTCCCGCGGAGTCCCGCTGCTCGTGCGCCCCGGACCGCCTGGAGCTGCTGGACGGGGCCGTCACCAACGTGGTGGCCGTGGACTGCTCCCGGAGACACCTGCAGCAGATGCCCACCAACCTGCCGCCTAACACCGTCAAGCTCAACGTCTCCTACAACAATATAACGTCTCTCCAAGCGGTGTCGGACGATCCGTCGTACGAGCACCTGAGGAACCTGTTGGTGGACCACAACGACATCACCAGCATCGTGGACCTGGAGGGGACCAAGTTCATAGACAACTTCATGGTGTTCTCCATCACCAACAACAAGCTGAAGACTATTCATACATACGTGTTGTCAAATCGCTTCGACACCACCGGACCGTCGTTTCTGATCGCGAACAACTACATCCACTGCGACTGCAACACCGAGAAGGTGCTGAAGCCCTGGTTGCTTGAAAACCTGAAAAGCATACCCGACTACAAGAGTCTGGAGTGCGAGGACAACATGGGGCCGGTGGTGGACCTCAAGGAGTTCCAGGTGTGTCACACGCCCAGGGACTGGACGGACTACATCTACTACATCATCGGCCTGGAAGTGCTGGTCCTGGTGCTGTTGATAAGCAAAGTCTCCTACGACTACTGGGTCTTCAAGACCGCCGGCTACCTGCCCTGGCCCGCCAACAAGATGCCGAGGCTGCCCTGTGATTGGTTGTGCGAGTGA

Protein sequence:

>DPOGS207497-PA
MSNSAASLVAIWCLVALITFNTGLIETRMEEVDSKYKSVEPHLTSTEKPKLEPHVDLKMGKCFHMSREHCPTETKCKRVDPVSVFCCDVDSTRLKEALAITVTQNTVNLHVLNATIEELDVSQSIFRRLTSMALTDGNIGRIVGQFPKYSSIACLNISNNNLSSATVGGPQPVQRPFAYLFTLSVLDASANNLTEFPLSLMQSNRKIFLDLSGNNYLPCKHFLIAMEANNSSLVTFLNYNRTFCALDLLFNWFTELQIVPIDHLRVQKEVNASCLKIRPAESRCSCAPDRLELLDGAVTNVVAVDCSRRHLQQMPTNLPPNTVKLNVSYNNITSLQAVSDDPSYEHLRNLLVDHNDITSIVDLEGTKFIDNFMVFSITNNKLKTIHTYVLSNRFDTTGPSFLIANNYIHCDCNTEKVLKPWLLENLKSIPDYKSLECEDNMGPVVDLKEFQVCHTPRDWTDYIYYIIGLEVLVLVLLISKVSYDYWVFKTAGYLPWPANKMPRLPCDWLCE-