Monarch geneset OGS2.0

DPOGS205291
TranscriptDPOGS205291-TA3165 bp
ProteinDPOGS205291-PA1054 aa
Genomic positionDPSCF300021 + 469838-480781
RNAseq coverage637x (Rank: top 20%)
Annotation
HeliconiusHMEL0174800.098.67% 
BombyxBGIBMGA011033-TA0.096.69% 
Drosophilasxc-PC0.085.78% 
EBI UniRef50UniRef50_E2C6I40.080.09%UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit n=19 Tax=Arthropoda RepID=E2C6I4_HARSA
NCBI RefSeqXP_967579.20.090.75%PREDICTED: similar to AGAP006254-PA [Tribolium castaneum]
NCBI nr blastpgi|2700045550.089.23%hypothetical protein TcasGA2_TC003916 [Tribolium castaneum]
NCBI nr blastxgi|3800197490.088.54%PREDICTED: UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit-like isoform 2 [Apis florea]
Group
Gene OntologyGO:00054886.1e-75binding
GO:00055159.3e-09protein binding
KEGG pathway 
InterPro domain[273-522] IPR0119906.1e-75Tetratricopeptide-like helical
[108-141] IPR0014409.3e-09Tetratricopeptide TPR-1
[108-141] IPR0197344e-08Tetratricopeptide repeat
Orthology groupMCL11196 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205291-TA
ATGCAACCTCAAGCGAATGTTGCCGTGCCTCAATCTGTCACGACGCAGCCTCAACAAATTGTCAGCGTTCCTGCAAATGCTGTGATCTTGAAAATGTCGGAAATTCAACAGATATCTACAGTGGGACTCCTGGAGCTTGCACACCGGGAATATCAAGCTGGAGACTATGATAGTGCGGAACTGCATTGTATGCAGCTATGGCGTCAAGATGGCACAAATACGGGTGTTCTTTTGCTGCTGTCCTCCATACATTTTCAATGTCGGCGTTTAGACAAATCAGCACATTTTTCAACGCTTGCTATAAAACAGAATCCTCTCCTGGCGGAGGCGTACAGTAATCTCGGAAATGTATACAAGGAGCGTGGGCAGTTGCAAGAGGCTTTGGAAAACTATCGTCACGCTGTCCGTCTAAAGCCAGATTTCATTGATGGGTACATCAACTTGGCAGCTGCCTTGGTGGCTGCAGGAGACATGGAACAGGCTGTACAGGCTTATGTTACAGCATTGCAGTATAATCCTGAACTTTACTGCGTTAGAAGTGACCTGGGCAATTTGCTCAAGGCCCTTGGACGTTTGGACGAAGCGAAGGCTTGTTACTTGAAGGCCATCGAAACGAGGCCAGACTTTGCAGTGGCATGGAGTAACCTAGGATGCGTTTTTAACGCACAAAGTGAAATCTGGTTGGCCATACATCATTTTGAAAAGGCCGTGGCATTGGATCCGAATTTCTTGGATGCTTATATCAATCTAGGAAATGTTCTCAAAGAAGCGAGAATTTTTGACAGGGCGGTGGCTGCATATTTACGAGCTCTTAATTTATCGCCGAACAATGCAGTTGTTCATGGTAATTTAGCGTGCGTGTATTATGAACAAGGACTTATTGATTTAGCGATCGACACTTATCGGCGAGCTATAGAACTTCAACCGAATTTCCCAGATGCCTACTGTAATTTGGCTAATGCATTAAAGGAAAAGGGTCAAGTGGTTGATGCGGAGGAATGTTATAATACTGCTCTAAGGTTGTGCCCATCACACGCTGATTCATTAAATAACTTAGCGAACATCAAACGCGAGCAAGGATACATAGAGGAAGCGACTCGTTTATATTTAAAAGCTTTGGAAGTATTTCCCGAGTTTGCAGCAGCTCATAGTAACTTGGCGTCAGTTTTGCAACAACAAGGCAAACTAAACGAAGCACTCATGCATTATAAAGAGGCTATACGTATACAGCCAACGTTTGCTGATGCTTATAGTAATATGGGCAATACTCTTAAAGAAATGCAAGACGTCGCTGGAGCATTGCAGTGTTATACCCGAGCTATACAAATTAATCCAGCGTTTGCCGATGCTCATAGCAATCTTGCCAGTATCCACAAAGATTCGGGAAATATACCGGAAGCTATACAGTCCTATAGAACAGCGTTGAAGTTGAAACCGGACTTCCCTGACGCGTATTGTAACTTGGCGCACTGTTTGCAAATCGTTTGCGATTGGACCGACTACGAGGCCCGTATGAAGAAATTAGTCAGTATTGTGGCAGAACAGCTTGAAAAGAATAGACTACCCTCGGTTCATCCTCATCATTCTATGCTTTACCCATTGACGCATGAATTCAGAAAGGCTATTGCGGCCCGACATGCGAATTTATGTCTGGAGAAGGTTCAAGTTCTCCACAAGCCGGCTTACAAATTTCCAAGAGAGCTGCAAAGCCGCCTGCGTATCGGTTATGTAAGCAGTGATTTTGGCAATCACCCAACATCACATTTGATGCAATCTGTGCCCGGATTACACGATCGTACTAAGGTCGAGATCTTTTGTTACGCTCTTAGTCCAGATGATGGTACAACATTCCGTTCTAAAATAGCTAGAGAAGCCGAGCACTTTATTGATCTATCACAGATTCCATGCAACGGCAAAGCTGCCGATAAAATATATTCTGATGGTATTAATATTCTGGTAAACATGAACGGATACACAAAGGGTGCCAGGAATGAAATATTTGCTCTACGTCCGGCTCCTGTGCAAGTAATGTGGCTCGGATATCCAGGCACAAGTGGTGCAAGTTATATGGACTACTTAGTAACTGATGCTGTAACATCTCCAGTCGAATTGGCAAGTCAGTACAGCGAGAAGCTCGCATACATGCCTCATACATATTTCGTCGGCGACCACAAGCAGATGTTCCCCCACTTACAGGAGAGATTGATAGTTAGTGACAAAATCAAATCCCATAATAACATGGGCAGTCTAGCTGATAATGTCGCCGTCATTAATGCAACTGATTTGTCTCCACTTGTCGAAAACACTGATATCAAAGAAATTAAAGAAGTTGTAAGAGCAGCGAGGCCGGTTGAAATATCATTGAAGGTCGCAGAGTTACCTACTACTACGCCTATAGAAAACATGATTGCTTCGGGACAAGTACAGACATCTGTAAATGGTGTCATCCTTCAAAACGGTCTGGCCACAACACAAACAAACAACAAAGCGGCTACAGGAGAGGAAGTGCCACAGTCTATTGTAATCACAACAAGACAACAGTACGGTCTACCGGATGATGCAGTGGTCTACTGTAATTTCAATCAACTGTATAAGATAGATCCGCTAACTCTACACATGTGGGTATACATATTGAAACACGTCCCTAACAGCGTGTTGTGGCTTTTGAGATTCCCGGCTGTCGGTGAACCTAATTTACAAGCAACGGCGCAGCAGTTGGGATTACCTCCCGGCCGTATAATCTTCTCAAACGTGGCTGCTAAAGAGGAGCACGTGAGGCGCGGTCAACTGGCGGACGTATGTCTAGACACACCCTTATGTAACGGTCACACTACTAGTATGGATATTTTGTGGACAGGCACCCCCGTTGTTACATTACCAGGAGAGACATTAGCCTCACGGGTGGCTGCATCACAACTCAATACACTTGGTTGTCCTGAACTGATTGCGAGAACGAGACAGGAATATCAAGACATAGCTGTACGATTAGGAACGGACAGGGAATATCTTAAAGCAATCCGAGTGAAAGTATGGACAGCTCGCACGGAGAGTCCACTATTCGACTGCAAAGCATACGCCACCGGTTTGGAGATGTTGTACAACAAAATGTGGTCGAGGTACGCTCGCAACGAGCGACCCGACCACATACAGGCCATAGACAAATAG

Protein sequence:

>DPOGS205291-PA
MQPQANVAVPQSVTTQPQQIVSVPANAVILKMSEIQQISTVGLLELAHREYQAGDYDSAELHCMQLWRQDGTNTGVLLLLSSIHFQCRRLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPELYCVRSDLGNLLKALGRLDEAKACYLKAIETRPDFAVAWSNLGCVFNAQSEIWLAIHHFEKAVALDPNFLDAYINLGNVLKEARIFDRAVAAYLRALNLSPNNAVVHGNLACVYYEQGLIDLAIDTYRRAIELQPNFPDAYCNLANALKEKGQVVDAEECYNTALRLCPSHADSLNNLANIKREQGYIEEATRLYLKALEVFPEFAAAHSNLASVLQQQGKLNEALMHYKEAIRIQPTFADAYSNMGNTLKEMQDVAGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLEKNRLPSVHPHHSMLYPLTHEFRKAIAARHANLCLEKVQVLHKPAYKFPRELQSRLRIGYVSSDFGNHPTSHLMQSVPGLHDRTKVEIFCYALSPDDGTTFRSKIAREAEHFIDLSQIPCNGKAADKIYSDGINILVNMNGYTKGARNEIFALRPAPVQVMWLGYPGTSGASYMDYLVTDAVTSPVELASQYSEKLAYMPHTYFVGDHKQMFPHLQERLIVSDKIKSHNNMGSLADNVAVINATDLSPLVENTDIKEIKEVVRAARPVEISLKVAELPTTTPIENMIASGQVQTSVNGVILQNGLATTQTNNKAATGEEVPQSIVITTRQQYGLPDDAVVYCNFNQLYKIDPLTLHMWVYILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLPPGRIIFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDILWTGTPVVTLPGETLASRVAASQLNTLGCPELIARTRQEYQDIAVRLGTDREYLKAIRVKVWTARTESPLFDCKAYATGLEMLYNKMWSRYARNERPDHIQAIDK-