Monarch geneset OGS2.0

DPOGS204034
TranscriptDPOGS204034-TA2655 bp
ProteinDPOGS204034-PA884 aa
Genomic positionDPSCF300138 + 39412-47978
RNAseq coverage190x (Rank: top 48%)
Annotation
HeliconiusHMEL0066860.065.46% 
BombyxBGIBMGA004783-TA0.071.41% 
DrosophilaCG31108-PA2e-15543.28% 
EBI UniRef50UniRef50_D6WT000.049.93%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WT00_TRICA
NCBI RefSeqXP_970924.20.050.52%PREDICTED: similar to CG31108 CG31108-PA [Tribolium castaneum]
NCBI nr blastpgi|2700098770.049.93%hypothetical protein TcasGA2_TC009196 [Tribolium castaneum]
NCBI nr blastxgi|2700098772e-17650.07%hypothetical protein TcasGA2_TC009196 [Tribolium castaneum]
Group
Gene OntologyGO:00064646.7e-209protein modification process
GO:00048356.7e-209tubulin-tyrosine ligase activity
KEGG pathway 
InterPro domain[228-719] IPR0043446.7e-209Tubulin-tyrosine ligase
Orthology groupMCL11636 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204034-TA
ATGAGAAGACATGATGATGAAACCCACAACGGGAAAGAGAAAGAGAATGAAGGATCTATATCAGATTGGGTTTCCGGTGGACCGATCGGGTCCAAAGGAGCCGTGCTCGTGTTCCGATCCTGCGTCCTCGCGACCAGGACGCCATCTGTTGAGTCGACAACAAAAACGGTTGGCAATAACACCGGTACCACGATACTTTTGAACAAGGCCAAGAATATGGCTAGAAGGTTGGTAGCAGAGCCGCACGCGTTCAGTGTCCGGCCCAAGAACATCATGCACGATGTGATGGAAACTTTCGGAAATAGCAACAAATACACGGGACAAGCGTTCGTAGTGAGGCACGTGAGTGAAGCTGAGGTGCCAGAGACCATGTCCGTAGATGATCAGGGCCACGAACGAACCGAACACAAGAGAGACATCGATGACGTCCTCAATGATTTAAATAAAATTACGCTTAAGAAGAATCTCAATATAGAAAATACAACGGCCAATATAAAACCTCCCGACGATCCCATGACTGTTAATAAGGATAAAGAAGATAAGAAAGGCAAGAAGAGCAAAGGCAAGAACAAACGCAAGAACACGCACAACCAAAGACCAGTGTCACCGGCGTTGGACAGCGATGACTCGGAACACATGTTGGTTAAAGAAGTAGACAGTGTTAAGGTGTTAGATAGTAATAATATGCCAGCAAGGATGCTCAAGATTACATACAGACTGGTCAACACGGAGACCAAGCTGTTACATAGACTTTTACAAGCTCACGGTCTTCAGGAAGCGTCCTCGGAAGCAAAGGATTTCAATCTGATGTGGGCGGGGCTGCATCCCAAGCCGGACGTGTTGAGGTCGCTCACACCCTACCAACGCGTTAATCATTTTCCTAGGGTTAATTTGGTCAAGAAGTTATTAGACGGACACGGTCTGTTAGAAGCTCACCCTGAATCAGCCACTTGGACCTTGTTCTGGTCTACGAATCTTACTCCCTTTGAAGTGTTAAGATCCCTCACAACGATGCAAAAAGTCAATCACTTACCCAGGTCCTACGAACTCACTCGGAAAGACAAACTGTTTAAGAACATAGAAAAGATGCAGTACTTCCGTGGACTGAAACACTTTGACTTCATCCCCACCACCTTCCTGATGCCCAACGAGTACAAAGAGCTGTGCACGACACACTACAGGACCAAGGGGCCGTGGATAGTCAAGCCGGCCGCGTCCAGCAGAGGGCGGGGCATTTATATCGTTAATACGCCCGAACAAATACCGAAGGGAGAAAATGTCGTCGTTGCCAAATATATAGACAAGCCGCTCCTCATCGGCGGTCACAAGTGCGACCTCAGACTCTACGTGTGCGTCACCTCCATTGACCCCTTACTGATATATCTGTACGAAGAAGGCTTGGTGAGATTCGCGACCGTCAAATACGATAAGACGAACAAGAACCTGTGGAACCCCTGCATGCATCTGTGTAACTACAGCATCAACAAGTACCACACGGATTACATCAAGTGCGACGACCCCAACGCCGGCAACATCGGTCACAAGTGGACGCTATCAGCTTTACTGCGACATCTCAGGAAGCAGGGGCGGAACACGTCGGCGTTGATGGCGGCCATTGAAGATCTGGTGGTGAAGTCTATCTTATCATCAGCACAGACCATCACCGCGGCTGCCAGAGTGTTCGTTCCCAGCCTCTTTAACTGTTTCGAACTCTTCGGATACGACATCCTCGTAGACGACATGCTGAAGCCCTGGCTGCTGGAGATAAATCTGTCACCGAGCCTGGCGTGTGAGAGTCCTTTGGATGCGAGGGTGAAGTCCGCCCTTCTAGCGGACACCCTGACGCTGGTGGGTCTGCCGGCTGTGCCCGGCAACAGACAGGAACAGCTGGGTCCTCAAGCGACCTCCCTGAAGATGAGGATCGGAGCTTGTCGCCGCGTTCACTCCGCTGAGAACGCATGCGTGAGGAACAAGAGTGGGGGGCCGGGGGCGGGAGCGGGCTCGGGGGTGCCCGGCGCCGGGCCTCTGGGAAACCTCCTCACGGGCGAGGAGCTGAGGATAGTCCGAGCCGTGAGGGCGCAGTACGCCAGGAGGGGGGGATTCGTTAGGATCTTCCCTTCGCAGAACAGCTGGCAGAAGTACTCACAGTATTTAGACCCGGTGACCGGCATCCCGGTCTGCTCGACGTCTCTCAACAACAACACGCCGTATACCGTCATACATAACTACAATCTGTTGGTCCACTCGCACGTGCTGCCGCACTTCCAGCACGCGACCGTCGCCACCAGCCTCACAGACACACCGGAGAGATTGAAGCGATATGAGGCAGCCTGTATAGCGGGTGCTGCGCCCGCCGTTCAGCTCTCAGAGCCGGCGACCCCGGCGCCCCGGGACGCCAGGAGGGCTAAGGACCTGGTCAAGAGACAGCTGGCCGAAGGAATGACGCTCACTCGAGGCGAGGTCCGTCGTGCGTTCGGTATGTACCTGACGCACGTCCTGAAGCGCATCTCGTCCCCCTGCACGGAGCAGGGCGCCCAACACGCCGCGCTGGTGCTGCGCTTCCTCCGCCGAGCCTCCGCTTCACTCCGCATGCCATATAATGTAAAGCCCGAGCGGCGGGCACCTGCCGCCCGTCGCTCGGAGTGTTACATCTAA

Protein sequence:

>DPOGS204034-PA
MRRHDDETHNGKEKENEGSISDWVSGGPIGSKGAVLVFRSCVLATRTPSVESTTKTVGNNTGTTILLNKAKNMARRLVAEPHAFSVRPKNIMHDVMETFGNSNKYTGQAFVVRHVSEAEVPETMSVDDQGHERTEHKRDIDDVLNDLNKITLKKNLNIENTTANIKPPDDPMTVNKDKEDKKGKKSKGKNKRKNTHNQRPVSPALDSDDSEHMLVKEVDSVKVLDSNNMPARMLKITYRLVNTETKLLHRLLQAHGLQEASSEAKDFNLMWAGLHPKPDVLRSLTPYQRVNHFPRVNLVKKLLDGHGLLEAHPESATWTLFWSTNLTPFEVLRSLTTMQKVNHLPRSYELTRKDKLFKNIEKMQYFRGLKHFDFIPTTFLMPNEYKELCTTHYRTKGPWIVKPAASSRGRGIYIVNTPEQIPKGENVVVAKYIDKPLLIGGHKCDLRLYVCVTSIDPLLIYLYEEGLVRFATVKYDKTNKNLWNPCMHLCNYSINKYHTDYIKCDDPNAGNIGHKWTLSALLRHLRKQGRNTSALMAAIEDLVVKSILSSAQTITAAARVFVPSLFNCFELFGYDILVDDMLKPWLLEINLSPSLACESPLDARVKSALLADTLTLVGLPAVPGNRQEQLGPQATSLKMRIGACRRVHSAENACVRNKSGGPGAGAGSGVPGAGPLGNLLTGEELRIVRAVRAQYARRGGFVRIFPSQNSWQKYSQYLDPVTGIPVCSTSLNNNTPYTVIHNYNLLVHSHVLPHFQHATVATSLTDTPERLKRYEAACIAGAAPAVQLSEPATPAPRDARRAKDLVKRQLAEGMTLTRGEVRRAFGMYLTHVLKRISSPCTEQGAQHAALVLRFLRRASASLRMPYNVKPERRAPAARRSECYI-