Monarch geneset OGS2.0

DPOGS201044
TranscriptDPOGS201044-TA1338 bp
ProteinDPOGS201044-PA445 aa
Genomic positionDPSCF300299 + 52598-57845
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0053641e-11569.12% 
BombyxBGIBMGA012487-TA8e-17161.42% 
DrosophilaCG17928-PA1e-8739.77% 
EBI UniRef50UniRef50_D6WVJ22e-10646.21%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WVJ2_TRICA
NCBI RefSeqXP_975884.14e-10746.21%PREDICTED: similar to CG17928 CG17928-PA isoform 2 [Tribolium castaneum]
NCBI nr blastpgi|910881318e-10646.21%PREDICTED: similar to CG17928 CG17928-PA isoform 2 [Tribolium castaneum]
NCBI nr blastxgi|910881317e-10746.32%PREDICTED: similar to CG17928 CG17928-PA isoform 2 [Tribolium castaneum]
Group
Gene OntologyGO:00066291.5e-18lipid metabolic process
GO:00200373e-18heme binding
KEGG pathway 
InterPro domain[170-410] IPR0058041.5e-18Fatty acid desaturase, type 1
[19-122] IPR0011993e-18Cytochrome b5
Orthology groupMCL19867 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201044-TA
ATGGCACCTGATCCAGAAAGACGGCAAGTGAGTTTTCCGAAACTAAAATATCCTTTGTTCAGAGAAGAAGAGCCAAAGAGTCCACAAAAATGGATAAAAGCGAAACAGATTCAAGATGGCGCCGAAGGTTTCTGGAGAATTCATGACAATATTTACGATCTCACCGAATTTATTCCTTCTCATCCTGGGGGCTCGCAGTGGCTTTCAATGACCAAGGGTACAGATATTACCGAAGCCTTTGAAACACATCACATAAACAGTACAGCCGAAGCTCTGTTACCCAAATATTTTATTAAGAAAGCCGACACCCCACGGAATTCACCCTTTACGTTTAAGGAAGATGGGTTTTATAAGACATTGAAAGCAAAAGTCGTGTTAAAATTAAAAGACATACCAGGCGACGTAAGGAAAAAGAGTGACAATGTAACAGATTTTCTCTTTGTATGCCTCGTGGTAGCTGGTCCGCTTTGTTGTTGGCTTTGGACAAAGAATTTAATATATGGAGCGGCCGCCACGCTGATTCTCGGTCTAACACTTTGTGCCTTAACAATCTGTGCCCACAACTATTTTCATAGAGCAGATAGCTGGCGGATGTACCTTTTCAATATAAGCGGCTTCTCATACCTTGATTGGCGGATTTCACACTCGATGTCTCATCACTTATACACAAACACAGCAAACGATATAGAATTGAGCTTCTTAGAGCCTTTCTTGCAATACTTGCCGAGGCCGGATAAGCCATTATGGGCCCAAATGGGGGCTTTCTTTTACCCCGTCGTATTTTTATTCACGTCACTCGGATGCATGATTAAAGAATTTGTTGCGGGAATATTAAAATTGGATGATAAAAAATTAACTTTGGCAAATGCCATACCTTTCGTATTGCCGGTATGGATGTGGTATATCAGTGGACTGTTTTTACCATGGACTCTGCTGGTTTGGCTGGCTACTACGATGATATCAAGTCTATTCTTCATGATATTCGGTCTCACTGCTGGACACCACGCTCATACAAACTTCTTCGAAGGAGACGTACCGAGAGAGGAGACACTCGATTGGGGAATCCATCAACTTGATTCAATAATAGAAAGGGTTGACTACGCGGGAGACCATTTCAAATCGCTCACTCGTTTCGGAGATCATGCCCTTCACCACCTGTTCCCAACGTTAGATCACGCCGAATTGAAGTACCTGTACCCCACATTATTAGAGCACTGTGAGAAATTTGAAACACAGCTTCGAACTACCACCTTCTACAACGCATTGATAAGCCAAAGCAAACAATTAATAAGGAAACGACCTAATAACTTCAAGCAAAAAGTGAAGCAAAGTAGTTAA

Protein sequence:

>DPOGS201044-PA
MAPDPERRQVSFPKLKYPLFREEEPKSPQKWIKAKQIQDGAEGFWRIHDNIYDLTEFIPSHPGGSQWLSMTKGTDITEAFETHHINSTAEALLPKYFIKKADTPRNSPFTFKEDGFYKTLKAKVVLKLKDIPGDVRKKSDNVTDFLFVCLVVAGPLCCWLWTKNLIYGAAATLILGLTLCALTICAHNYFHRADSWRMYLFNISGFSYLDWRISHSMSHHLYTNTANDIELSFLEPFLQYLPRPDKPLWAQMGAFFYPVVFLFTSLGCMIKEFVAGILKLDDKKLTLANAIPFVLPVWMWYISGLFLPWTLLVWLATTMISSLFFMIFGLTAGHHAHTNFFEGDVPREETLDWGIHQLDSIIERVDYAGDHFKSLTRFGDHALHHLFPTLDHAELKYLYPTLLEHCEKFETQLRTTTFYNALISQSKQLIRKRPNNFKQKVKQSS-