Monarch geneset OGS2.0

DPOGS201043
TranscriptDPOGS201043-TA1338 bp
ProteinDPOGS201043-PA445 aa
Genomic positionDPSCF300299 + 42158-47529
RNAseq coverage0x (Rank: top 95%)
Annotation
HeliconiusHMEL0053641e-11569.12% 
BombyxBGIBMGA012487-TA8e-17161.42% 
DrosophilaCG17928-PA1e-8739.77% 
EBI UniRef50UniRef50_D6WVJ22e-10646.21%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WVJ2_TRICA
NCBI RefSeqXP_975884.14e-10746.21%PREDICTED: similar to CG17928 CG17928-PA isoform 2 [Tribolium castaneum]
NCBI nr blastpgi|910881318e-10646.21%PREDICTED: similar to CG17928 CG17928-PA isoform 2 [Tribolium castaneum]
NCBI nr blastxgi|910881317e-10746.32%PREDICTED: similar to CG17928 CG17928-PA isoform 2 [Tribolium castaneum]
Group
Gene OntologyGO:00066291.5e-18lipid metabolic process
GO:00200373e-18heme binding
KEGG pathway 
InterPro domain[170-410] IPR0058041.5e-18Fatty acid desaturase, type 1
[19-122] IPR0011993e-18Cytochrome b5
Orthology groupMCL19867 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201043-TA
ATGGCACCTGATCCAGAAAGACGGCAAGTGAGTTTTCCGAAACTAAAATATCCTTTGTTCAGAGAAGAAGAGCCAAAGAGTCCACAAAAATGGATAAAAGCGAAACAGATTCAAGATGGCGCCGAAGGTTTCTGGAGAATTCATGACAATATTTACGATCTCACCGAATTTATTCCTTCTCATCCTGGGGGCTCGCAGTGGCTTTCAATGACCAAGGGTACAGATATTACCGAAGCCTTTGAAACACATCACATAAACAGTACAGCCGAAGCTCTGTTACCCAAATATTTTATTAAGAAAGCCGACACCCCACGGAATTCACCCTTTACGTTTAAGGAAGATGGGTTTTATAAGACATTGAAAGCAAAAGTCGTGTTAAAATTAAAAGACATACCAGGCGACGTAAGGAAAAAGAGTGACAATGTAACAGATTTTCTCTTTGTATGCCTCGTGGTAGCTGGTCCGCTTTGTTGTTGGCTTTGGACAAAGAATTTAATATATGGAGCGGCCGCCACGCTGATTCTCGGTCTAACACTTTGTGCCTTAACAATCTGTGCCCACAACTATTTTCATAGAGCAGATAGCTGGCGGATGTACCTTTTCAATATAAGCGGCTTCTCATACCTTGATTGGCGGATTTCACACTCGATGTCTCATCACTTATACACAAACACAGCAAACGATATAGAATTGAGCTTCTTAGAGCCTTTCTTGCAATACTTACCGAGGCCGGATAAGCCATTATGGGCCCAAATGGGGGCTTTCTTTTACCCCGTCGTATTTTTATTCACGTCACTCGGATGCATGATTAAAGAATTTGTTGCGGGAATATTAAAATTGGATGATAAAAAATTAACTTTGGCAAATGCCATACCTTTCGTATTGCCGGTATGGATGTGGTATATCAGTGGACTGTTTTTACCATGGACTCTGCTGGTTTGGCTGGCTACTACGATGATATCAAGTCTATTCTTCATGATATTCGGTCTCACTGCTGGACACCACGCTCATACAAACTTCTTCGAAGGAGACGTACCGAGAGAGGAGACACTCGATTGGGGAATCCATCAACTTGATTCAATAATAGAAAGGGTTGACTACGCGGGAGACCATTTCAAATCGCTCACTCGTTTCGGAGATCATGCCCTTCACCACCTGTTCCCAACGTTAGATCACGCCGAATTGAAGTACCTGTACCCCACATTATTAGAGCACTGTGAGAAATTTGAAACACAGCTTCGAACTACCACCTTCTACAACGCATTGATAAGCCAAAGCAAACAATTAATAAGGAAACGACCTAATAACTTCAAGCAAAAAGTGAAGCAAAGTAGTTAA

Protein sequence:

>DPOGS201043-PA
MAPDPERRQVSFPKLKYPLFREEEPKSPQKWIKAKQIQDGAEGFWRIHDNIYDLTEFIPSHPGGSQWLSMTKGTDITEAFETHHINSTAEALLPKYFIKKADTPRNSPFTFKEDGFYKTLKAKVVLKLKDIPGDVRKKSDNVTDFLFVCLVVAGPLCCWLWTKNLIYGAAATLILGLTLCALTICAHNYFHRADSWRMYLFNISGFSYLDWRISHSMSHHLYTNTANDIELSFLEPFLQYLPRPDKPLWAQMGAFFYPVVFLFTSLGCMIKEFVAGILKLDDKKLTLANAIPFVLPVWMWYISGLFLPWTLLVWLATTMISSLFFMIFGLTAGHHAHTNFFEGDVPREETLDWGIHQLDSIIERVDYAGDHFKSLTRFGDHALHHLFPTLDHAELKYLYPTLLEHCEKFETQLRTTTFYNALISQSKQLIRKRPNNFKQKVKQSS-