Monarch geneset OGS2.0

DPOGS212027
TranscriptDPOGS212027-TA2418 bp
ProteinDPOGS212027-PA805 aa
Genomic positionDPSCF300054 - 481025-496854
RNAseq coverage528x (Rank: top 24%)
Annotation
HeliconiusHMEL0129720.066.35% 
BombyxBGIBMGA010181-TA2e-9860.65% 
DrosophilaCG16833-PD6e-4548.04% 
EBI UniRef50UniRef50_Q7PVY23e-5159.26%AGAP009150-PA n=4 Tax=Culicidae RepID=Q7PVY2_ANOGA
NCBI RefSeqXP_001654205.16e-5359.88%hypothetical protein AaeL_AAEL001898 [Aedes aegypti]
NCBI nr blastpgi|1571250031e-5159.88%hypothetical protein AaeL_AAEL001898 [Aedes aegypti]
NCBI nr blastxgi|1571250036e-5428.32%hypothetical protein AaeL_AAEL001898 [Aedes aegypti]
Group
Gene OntologyGO:00064646.6e-62protein modification process
GO:00048356.6e-62tubulin-tyrosine ligase activity
KEGG pathway 
InterPro domain[289-594] IPR0043446.6e-62Tubulin-tyrosine ligase
Orthology groupMCL25708 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212027-TA
ATGGATGGCATGAGAACGTGCCCAAGCTACGAGGCCGGGTTCGAGGCGTGCGATCCTTTGCGGTCAGTTTTTGAGTGGTCAGGGCGAGGACCGGCGCGTGTGCTGAGACGGCGCTGCCGCAGCGAAGGTGTCGATCGGTCTCCATATGAGGATGCGACGAGGGAGTTACATAGACCCGAATATCGCCAGTGCCGTGCATTAAATAAAAGCCACTTTACCTTGAACAACAGTTACGAGCGATTTGAGAGGTTCGACCACGAGTGCTCACTGCCCGCTGAGAACGGCGACTACGTTATGGCACACGAAAGTAACTTACGCCATGGTTACCAGCCAGGAAGAACGAAATATCAAAGAACTAGTCCCATTCGTCCATTGTTAGTCACCACATTATCAAACAATAAGCTAGATGACAGCCGTCTACCTAAAGTTGAGGAATTGCTTGAGAAAGTGGACCAGGAAATGATAGAGGTTCCTATGACCAAACAGCCTTTAAAGTCAATACTCAATAATCCTTTGCCTAAGACACAGCCGCAAAGTATATTAAAGAAACCTAAAAATAGATCATATCAGGAATCCAAATCTGCATTAGAAGGACGCAGGAGACAGAGAGCATCCAGTGAATCTGATAATTTTTATTGTAGTTTTCATCTCGGTTCATCCTTACCGGGATATGATGGTCACCTTTCATATAGATCTAGTAACATGAAGCGGGCTAAAGATAATTCTGGTACTGGATATTCATATGGAACTGCTATAGCCGCTCTCGGTAATACCCCTAGGGCAAAATCTCCAACCAACTATCCTAAGAGCAGATGCAATTGTCAAAACCAAGTCATAGTTACACCTAAAGATGAAGCATACAGAGTACATTTGTTAAGGACGTCCGCAATAAACAGTGAAACTGTCAATGATGTCCCTGTTGTTGATATGCCTCGTCAAGAGACGCATACTCCTACGAGACCGGCTCACAGCCCCGCGCTGCCAGTGAGTTCTGTGACACCAACCACCGTCACCAAGAAGAAGAAAAAAAAGAAAACTAAGACGAAGACCACGATCAGCACTCAAGAGGAAGCTGATGATTCTCACAGCCGCTCGCCATCCCCGGATCCGACAGTGGACACCTGGAAAATGGAAACCGAAACGGAAGTAGACAAGACGGACAAACAGCTGCATAATGACGCAGACAAAAACCCCGTGGTGAAGCTGGCGACTAAACTCAACGGCACCCTCGTGTCTCCCAAGAAGGCTGCGTCTAAAGAATCACCCCTCCACCTCCCCGAGTCGATGTCTAACTGTCTCCGGCCGTCTCTGTTCCCGCGAGTGCCTCCTTACCTGAAGTTTATAAGTCACGATGACACCGCACCACTCAAAATGCCGATGGCCATACAGAAACACCTCAAGTGGAAGTTGACCACAATCACACCGATAGTTGTCAAGAAGACGTTAACCAACTCGGGGTTCAGATTGGTCAAGAGCGAGTGCGACACATCCGAGTGCCCCCAAGAAGAAACCGTGGATTGGATCGGTATATGGGGAAAACACATGAAATCTATCATGTTCCGCGCCATCAAAGACGGGCAGAAGATGAACCACTTCCCAGGAACCTTCCAGATAGGACGCAAGGACCGGTTGTGGAGGAACCTACAGAAGTTGGCGTCTAGACACGGGGTCAGCGAGTTCGGCATCATGCCCAAAACATACGTCCTGCCTCACGATCTGAAAATACTGAAACACGACTGGGAGAAGTACGCCGCCAACAACGAGAAGTGGATCATAAAGCCGAATGGTTTGCCGTACGACAGTAGACTGTACACGGTGTATCTGTCGAAGGAAGAACGCGATAAACATATTATATACACACACATGGAAGACAGAGATCTGTATCTAAGGGACATCTTATCAACCTTGACCCCCGACGATGTCCGACACCTCATACAAGCTGAAGACGAACTGACGCAGATAGACTCTATGGAGCGAGTGTTCCCTACAAAGAACACGCATAAATACCTGACCTTCCTGGTAGGGCCACGGTACTATAACAGACTGTTTGACGCCTGGGAGAGCAGATACTGCGACTATAGAGAACCAGGCATCGAGCTGTTGCGTAATCTCTGCGACATCGGCTACCACCTGGAGGTGCCCCCCGTACCGTTAAAGGTAACCCCCTCTCCCGACCCCTCGCCCCGCATCTGCGACATCGGCTACCACCTGGAGGTGCCCCCCGTACCGTTAAAGGTAACCCCCCCTCCCGACCCCTCGCCCCGCAGCCCCCACCCCTCACCCCCGCCGAGGACCTCTCCTGCACGCTCCCCGCCCGTCAGTCGCCCCGCTCTCGCCCTAGCGTTAGCACGCGCCGCCCGTCACCTCGCACGCGCTGCTGTCACGCTCGTGTTGTATGTCATGTTGATGCGTCTCTGA

Protein sequence:

>DPOGS212027-PA
MDGMRTCPSYEAGFEACDPLRSVFEWSGRGPARVLRRRCRSEGVDRSPYEDATRELHRPEYRQCRALNKSHFTLNNSYERFERFDHECSLPAENGDYVMAHESNLRHGYQPGRTKYQRTSPIRPLLVTTLSNNKLDDSRLPKVEELLEKVDQEMIEVPMTKQPLKSILNNPLPKTQPQSILKKPKNRSYQESKSALEGRRRQRASSESDNFYCSFHLGSSLPGYDGHLSYRSSNMKRAKDNSGTGYSYGTAIAALGNTPRAKSPTNYPKSRCNCQNQVIVTPKDEAYRVHLLRTSAINSETVNDVPVVDMPRQETHTPTRPAHSPALPVSSVTPTTVTKKKKKKKTKTKTTISTQEEADDSHSRSPSPDPTVDTWKMETETEVDKTDKQLHNDADKNPVVKLATKLNGTLVSPKKAASKESPLHLPESMSNCLRPSLFPRVPPYLKFISHDDTAPLKMPMAIQKHLKWKLTTITPIVVKKTLTNSGFRLVKSECDTSECPQEETVDWIGIWGKHMKSIMFRAIKDGQKMNHFPGTFQIGRKDRLWRNLQKLASRHGVSEFGIMPKTYVLPHDLKILKHDWEKYAANNEKWIIKPNGLPYDSRLYTVYLSKEERDKHIIYTHMEDRDLYLRDILSTLTPDDVRHLIQAEDELTQIDSMERVFPTKNTHKYLTFLVGPRYYNRLFDAWESRYCDYREPGIELLRNLCDIGYHLEVPPVPLKVTPSPDPSPRICDIGYHLEVPPVPLKVTPPPDPSPRSPHPSPPPRTSPARSPPVSRPALALALARAARHLARAAVTLVLYVMLMRL-