Monarch geneset OGS2.0

DPOGS212052
TranscriptDPOGS212052-TA1080 bp
ProteinDPOGS212052-PA359 aa
Genomic positionDPSCF300054 + 860797-862031
RNAseq coverage336x (Rank: top 34%)
Annotation
HeliconiusHMEL0135958e-11284.76% 
BombyxBGIBMGA010200-TA3e-15261.90% 
DrosophilaCG1291-PA4e-7357.84% 
EBI UniRef50UniRef50_B7PS873e-8640.79%AHPC/TSA protein, putative n=1 Tax=Ixodes scapularis RepID=B7PS87_IXOSC
NCBI RefSeqXP_001655818.13e-10247.50%alpha-1,3-mannosyltransferase [Aedes aegypti]
NCBI nr blastpgi|3123724925e-10546.78%hypothetical protein AND_20108 [Anopheles darlingi]
NCBI nr blastxgi|3123724922e-10146.90%hypothetical protein AND_20108 [Anopheles darlingi]
Group
Gene OntologyGO:00090582.1e-25biosynthetic process
KEGG pathwayaag:AaeL_AAEL0120349e-102 
 K03843 (ALG2)maps-> N-Glycan biosynthesis
InterPro domain[212-321] IPR0012962.1e-25Glycosyl transferase, family 1
Orthology groupMCL13937 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212052-TA
ATGGTTAAAATAATATTTCTCCATCCCGATTTGGGTGTTGGGGGAGCAGAGCGTTTAGTTTTGGATGCCGGCTTGGCCTTCAAAAGCAAAGGCCACGATGTAATATATTACACAAATCATCACGATCCATCACATTGTTTCGCCGAAACTAGAAATGGGACATTTCCTGTGAATGTAGTTGGGGATTGGATCCCACGTTCTATATTTGGAAGATTCAAGGCCGCGTGTGCTTATGCACGTATGGTTTTTGCAGCTATTTATTTAGCATGGTACGTGATTCCGGCTGAAGAGCCAACTTTAATTTTCTGTGATTTGATTTCATTATGTATTCCATTCTTGAAGTTGGCCCGTGGGCCTCATAGAATAGTTTTTTATTGTCATCACCCTGATAAGCTATTGTCTGCTGAAGGAGGTTTCCTAAAGAAGTTATACAGAGCTCCACTTAACTGGTTAGAGGAATTGACAACTGCTAGGGCAGATAAAGTTTTGGTTAACAGCAAATATACAGCCAGGGTTTATAAAGATGCATTTCAAAAAATTAAAGATATTCCCGACATTTGTTACCCCTCTATCAATACAGAGTTTTTTAAATCTGCTGTGCCAAAAGCCATAAAAGAAATATTGCCGATAGAACTAACCGACTTAACGGCGGAACTTGATTTGGAAGAAAAAGTGACATTAATGAAATCTCCAAGAGATCTAGAAAAGGTATCTCTGTTGTACAACTGTAAGGCATTGATTTATACACCATCAAATGAACATTTTGGTATTGTTCCTCTTGAAGCGATGTATTATTCCAAGCCTGTCATAGCGGTGAACAGTGGCGGACCGACGGAAACTATTGTTAATGAAGTCACAGGATTCCTATGTGAACCAACGAGTGAATCCTTTGCGAAAGCTATGTGTACTTTAATGACTGATCCTGAACTATGCAGGAAGTTGGGTGAAGCTGGGAGGAAGAGATTTGATACAAAATTTTCCTTTGAGGCCTTCACAAATCAGATAGAGGGAATATTAACAAGGGAAAGACAAGTTATATCTGAGGCTCGTGCGATTGAATATGAGAAGAAAAATAAATAG

Protein sequence:

>DPOGS212052-PA
MVKIIFLHPDLGVGGAERLVLDAGLAFKSKGHDVIYYTNHHDPSHCFAETRNGTFPVNVVGDWIPRSIFGRFKAACAYARMVFAAIYLAWYVIPAEEPTLIFCDLISLCIPFLKLARGPHRIVFYCHHPDKLLSAEGGFLKKLYRAPLNWLEELTTARADKVLVNSKYTARVYKDAFQKIKDIPDICYPSINTEFFKSAVPKAIKEILPIELTDLTAELDLEEKVTLMKSPRDLEKVSLLYNCKALIYTPSNEHFGIVPLEAMYYSKPVIAVNSGGPTETIVNEVTGFLCEPTSESFAKAMCTLMTDPELCRKLGEAGRKRFDTKFSFEAFTNQIEGILTRERQVISEARAIEYEKKNK-