Monarch geneset OGS2.0

DPOGS210450
TranscriptDPOGS210450-TA1122 bp
ProteinDPOGS210450-PA373 aa
Genomic positionDPSCF300062 + 111581-119237
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0040412e-2847.20% 
BombyxBGIBMGA011436-TA1e-0626.50% 
DrosophilaCG5783-PA3e-2027.45% 
EBI UniRef50UniRef50_B0XIX83e-1828.70%Putative uncharacterized protein n=2 Tax=Culex quinquefasciatus RepID=B0XIX8_CULQU
NCBI RefSeqXP_002065604.11e-2028.92%GK15541 [Drosophila willistoni]
NCBI nr blastpgi|1954352293e-1928.92%GK15541 [Drosophila willistoni]
NCBI nr blastxgi|1954352292e-1928.92%GK15541 [Drosophila willistoni]
Group
Gene OntologyGO:00167471.5e-17transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathway 
InterPro domain[236-360] IPR0161811.1e-32Acyl-CoA N-acyltransferase
[286-367] IPR0136531.5e-17FR47-like
Orthology groupMCL17250 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210450-TA
ATGGTGCATCGCTTAAAACTAGTGCCTATTCAAAAATGGCCTGAAATAAGATCTTTATTAAAATTATATCTTCCCAGAAGTATAGCTGGGCAAAATTTTTTGAAGACCAGGGAAGAAATTGAAAAGATTGGTTATGGGTATAAAGCCGAAGTGTACTGCCCAGACGGTGATGCCTCTAATGGAATTGTTGCCCTTAACGTTAAGGACAAGCTCTGCGAGGTTAATATTCAGTGCCCTAAATATGATACCGGAAAACTGGAAGAAGCTTTGAGAACAACAGAAGTAATAGACTGGACCAAATGTGTAAAATTGATATACGCTCAAAAGCATGTGATGCAATGTATGATGAAAGCAATAAGGGATAAAAATATTGCCATAAAAGAAGTCATACCTTCAGTGACATTTGTCAAGTATAACAACGATCCACTTTTTGACGTAAGTTTACCAAAAGGATATAGTTTTGAGGCGTTAACTCTAAAGTATGTAGATATGAATTCCTTTTACGAAGTGATTATCCAATGCCCGAATAATGACACCACAGAACTTGAAGAAGCTTTGAAAACAACAAAAGTTATAGACTGGGCGCGAAAACTTGAAGTTCCGTTCGCACCTAAAAATGTACGAGACTGTATGGAAAGAATCATAAATGAAAGAAATTACACATTACAGTACATTGATATTACAGACACGTTTATACTCAAGAGAAACGCAACACCATTCAATATGAGACTAGCCCCGGAACTGTCCTTCAAACTTCTTACTTTACATTACAAGGATACGGTTAATAACGCATGGCCGCACAAATACCCGGGATCTGATTGGTATTTTGAATTACTAATAAAAGCCAATTTAGGCTACGGCCTGTTTAAAGGAGACGAGCTAATTTCGTGGGTTTTCATTAAAGAAATGGGAGCGCTCGGACATCTCTACACTTTGGAGGAGCATAGAAGGAAAGGTTACGGAGAATTAGTTTTAAAACTCATATCAAATGTATTACTGAATGAGGGGAAATACGTCGTAGCTTTTTGCATCAAAGGTAATGAGAATGCATGCAAGCTGTATAAAAAACTGAATTTCGAGAACGTCCAAGTCGTTTATTGGTGCAATTTTATAGGTAATTAA

Protein sequence:

>DPOGS210450-PA
MVHRLKLVPIQKWPEIRSLLKLYLPRSIAGQNFLKTREEIEKIGYGYKAEVYCPDGDASNGIVALNVKDKLCEVNIQCPKYDTGKLEEALRTTEVIDWTKCVKLIYAQKHVMQCMMKAIRDKNIAIKEVIPSVTFVKYNNDPLFDVSLPKGYSFEALTLKYVDMNSFYEVIIQCPNNDTTELEEALKTTKVIDWARKLEVPFAPKNVRDCMERIINERNYTLQYIDITDTFILKRNATPFNMRLAPELSFKLLTLHYKDTVNNAWPHKYPGSDWYFELLIKANLGYGLFKGDELISWVFIKEMGALGHLYTLEEHRRKGYGELVLKLISNVLLNEGKYVVAFCIKGNENACKLYKKLNFENVQVVYWCNFIGN-