Monarch geneset OGS2.0

DPOGS213757
TranscriptDPOGS213757-TA1188 bp
ProteinDPOGS213757-PA395 aa
Genomic positionDPSCF300212 - 377050-378349
RNAseq coverage109x (Rank: top 60%)
Annotation
HeliconiusHMEL0104661e-11863.26% 
BombyxBGIBMGA009243-TA8e-13157.44% 
DrosophilaFucTC-PB2e-4736.59% 
EBI UniRef50UniRef50_Q7QF912e-5439.94%AGAP000365-PA n=3 Tax=Culicidae RepID=Q7QF91_ANOGA
NCBI RefSeqXP_310745.43e-5539.94%AGAP000365-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479636866e-5439.94%AGAP000365-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3800235323e-5335.80%PREDICTED: alpha-(1,3)-fucosyltransferase C-like [Apis florea]
Group
Gene OntologyGO:00160203.5e-70membrane
GO:00084173.5e-70fucosyltransferase activity
GO:00064863.5e-70protein glycosylation
KEGG pathwayapi:1001617266e-40 
 K00753 (E2.4.1.214)maps-> N-Glycan biosynthesis
InterPro domain[40-388] IPR0015033.5e-70Glycosyl transferase, family 10
Orthology groupMCL11198 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213757-TA
ATGATAACCTTCATTGTCTTTACACTGTCGATGATATGTATTCAATTCGTAAATAACAGCTTTATTGAATCTGATTCATTGGTCCAAGAAGTCATAGAAAATGTTGGAAGAGATTTAAGATACGCCGATATCTACAGGAGAGCTGACAAACTGTCGAAAAACCTTAAGTACATGTTGATTTGGTCTGGGGCAGAAGACGCCCCGTTATCATACTTTGGCGGTGGTCAAAGAAAATTTCTTGAGAAAAACTGTACCAATATAAATTGTTACGTCACAACAGATAGGAATTTCTTCAATGGGGATACGACGAAATTTCACGCGATCGCCTTTAATGGCCGTACTATTACAACGATGGGGAAGTCTCAGCTTCCGAAACGTCGATCACATCACCAAAAATTTATTTACTTCAATATGGAATCGGCTGATAATTACCCTTTATGTTCAGCTTATTTTGATGATTTCTTTAACTGGACAGCAACCTACAGATTGGATTCAGATATACCGCTTACTTACATTCAAATCAGGGATAATAATGGAACAGTCGTTGGACCAAAGAAAGATATGAAGTGGGTAGATATGGGTTTTCTTGAAGATGAAGAATTAGAATTGAGGATGCAGAACAAAACCAAAGCAGTTGCCTGGTTCGTGTCGCACTGTAAAACGAGAAGTAAGAGAAAAGACTATGCCATCCAATTGAAAAAAGCTTTGTATTCATTCGGTTTCTCTGTGGACATATATGGTAAATGTGGTCCTTTTAAGTGTCCAAGGCATAAGGAGGAAACATGTTTTTCACTTTTGGAAAGAGATTACGCTTTTTATCTCTCTTTCGAAAACTCTTTTGCCGAAGATTACGTAACGGAGAAGATACTTACTGCTTTACAGCATACAACCGTACCAATTGTGCGTGGTGGTGCCGATTACTCCAGATTTCTACCTCCTGGATCTTACGTCGACGCCACTAAGGTCACACCCAATGTTTTGGCTTCCGAGATTGTTCACATTATGATGAATACAAAGTACTACAGTCAGTTCTTCAGATGGTGGTCGCACTACAGTTACAGAGACCCGTCTCAATCGGATCATATTTGTGCATTGTGTGATGCACTTAACGATGAAAGCAAAAGAGCAAAAACTAGTACTTACAAAAACTTTCGGAAATGGTGGAACACTAACAATAGGTGTAAATAA

Protein sequence:

>DPOGS213757-PA
MITFIVFTLSMICIQFVNNSFIESDSLVQEVIENVGRDLRYADIYRRADKLSKNLKYMLIWSGAEDAPLSYFGGGQRKFLEKNCTNINCYVTTDRNFFNGDTTKFHAIAFNGRTITTMGKSQLPKRRSHHQKFIYFNMESADNYPLCSAYFDDFFNWTATYRLDSDIPLTYIQIRDNNGTVVGPKKDMKWVDMGFLEDEELELRMQNKTKAVAWFVSHCKTRSKRKDYAIQLKKALYSFGFSVDIYGKCGPFKCPRHKEETCFSLLERDYAFYLSFENSFAEDYVTEKILTALQHTTVPIVRGGADYSRFLPPGSYVDATKVTPNVLASEIVHIMMNTKYYSQFFRWWSHYSYRDPSQSDHICALCDALNDESKRAKTSTYKNFRKWWNTNNRCK-