Monarch geneset OGS2.0

DPOGS202073
TranscriptDPOGS202073-TA1371 bp
ProteinDPOGS202073-PA456 aa
Genomic positionDPSCF300053 + 1199100-1202351
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0053660.071.40% 
BombyxBGIBMGA012486-TA0.067.18% 
DrosophilaCyt-b5-r-PA3e-9141.27% 
EBI UniRef50UniRef50_D6WVJ22e-10146.23%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WVJ2_TRICA
NCBI RefSeqXP_001648852.12e-10745.94%hypothetical protein AaeL_AAEL004278 [Aedes aegypti]
NCBI nr blastpgi|1571053994e-10645.94%hypothetical protein AaeL_AAEL004278 [Aedes aegypti]
NCBI nr blastxgi|1571053994e-10645.94%hypothetical protein AaeL_AAEL004278 [Aedes aegypti]
Group
Gene OntologyGO:00200378.3e-18heme binding
GO:00066294.2e-12lipid metabolic process
KEGG pathway 
InterPro domain[31-134] IPR0011998.3e-18Cytochrome b5
[175-426] IPR0058044.2e-12Fatty acid desaturase, type 1
Orthology groupMCL11192 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202073-TA
ATGCCACCAAATGTAAAATACTTAGACGTAGCTTTTCAAAGGGAGGTTGAAAAGAAAACACACGTCAGCTTTCCTCAGTTGAAGTACCCATCGTTGCGAGATGAGGGGCTGCGAGATCCGATACAATGGCTTATGGGGAAGGCCATGGATGATGGCGCAGAAGGGTTGTGGAGGGTTCACAATGGGTTGTATGATTTGGAGGAGTTTATTCAAAGACACCCCGGAGGCGCGGAGTGGCTGGAGCTTACTAAGGGCACAGATATAACAGAAGCATTTGAATGTCACCACATAGGTCCGCTCGCCGAAAAAATGTTGAAAAATTATTATGTAAGAGACGCAAAAACAGCTAGGAATTCACCGTTTACGTTTAAAGAAGATGGCTTCTATCGTACTCTGAAAAGGACTGTAAGAGATGAAATAGAAAAATTACCAACAAATCTATCAAATCATACGGATATGATAATGGACGGGCTTCTTGTAACATGTCTTGTGGCCTCGGCGCTGTCATGTTGGGCGACTAATTACTGGCTTGTAATGGGATCTTATATTGTAGCATCTGTTTCACTGGGATGGGCAGTTATTGCTGCACATAATTACCTACATAGACGAACTAATTGGAGAATGTACATATTTAATCTTAGTTTATGGTCTTATAGGGATTTTAGAGTCTCCCATGCATTGTCACATCATCTTTATCCTAACACATTAATGGATTTAGAAGTAAGTGCTTTGGAGCCGTTGGTATATTGGAACCCAATGAGAAACAAGCCTTTATGGGCTTATTTTGCTATCGTTATTGAACAACTTCTATTCCCCTTCATGTTTATTCTAAGCTTTTGCAAAAGGATGTCATTAATTTTCTTAAGGAAAGATTTCTTCGAGAAACATATCCGTTGGCATGACGGTGTCGGTCTACTCTTGCCCCTCTGGATGTATCTAGCTAGCGGCGCTAATTTACACACAGTTATGGTTAATTGGATCTGGATCGTTTGTAGTGCGAGTTTCATTTTCTATACAACCGGGTCAAATGCAGCACATCACCATCCCCAAATATTTAAAGATGGCGATGAAGTTAGTGATGTTACGCCTGATTGGGGTATGCATGAGCTAGAAGCTGTAATGGATCGCCATGAAGTAAATAGCAGTTGCTTTAGAGTCTTGGTAATGTTTGGTCACCACGCCTTGCACCACCTATTTCCAACTCTTGATCACGCAGTTTTGGAACATCTTTACCCTGTATTTTTGGAACATTGTGAAAAATTCAAAGCAAACTTTAGGTATATGTCCCAATTCGAACTTTTCATCGGTCAGATAAAACAATCTGTCAAAACAAAACCAACACTCCTATCAGAAAAGAAACGTGCCTTTTGA

Protein sequence:

>DPOGS202073-PA
MPPNVKYLDVAFQREVEKKTHVSFPQLKYPSLRDEGLRDPIQWLMGKAMDDGAEGLWRVHNGLYDLEEFIQRHPGGAEWLELTKGTDITEAFECHHIGPLAEKMLKNYYVRDAKTARNSPFTFKEDGFYRTLKRTVRDEIEKLPTNLSNHTDMIMDGLLVTCLVASALSCWATNYWLVMGSYIVASVSLGWAVIAAHNYLHRRTNWRMYIFNLSLWSYRDFRVSHALSHHLYPNTLMDLEVSALEPLVYWNPMRNKPLWAYFAIVIEQLLFPFMFILSFCKRMSLIFLRKDFFEKHIRWHDGVGLLLPLWMYLASGANLHTVMVNWIWIVCSASFIFYTTGSNAAHHHPQIFKDGDEVSDVTPDWGMHELEAVMDRHEVNSSCFRVLVMFGHHALHHLFPTLDHAVLEHLYPVFLEHCEKFKANFRYMSQFELFIGQIKQSVKTKPTLLSEKKRAF-