Monarch geneset OGS2.0

DPOGS212583
TranscriptDPOGS212583-TA1479 bp
ProteinDPOGS212583-PA492 aa
Genomic positionDPSCF300075 + 524473-526708
RNAseq coverage6x (Rank: top 87%)
Annotation
Heliconius% 
BombyxBGIBMGA009435-TA9e-1236.08% 
Drosophila% 
EBI UniRef50UniRef50_B5W8Y74e-1626.78%TPR repeat-containing protein n=7 Tax=Oscillatoriales RepID=B5W8Y7_SPIMA
NCBI RefSeq%
NCBI nr blastpgi|2095279241e-1526.78%TPR repeat-containing protein [Arthrospira maxima CS-328]
NCBI nr blastxgi|2964788936e-1931.92%proteoglycan 4 [Bos taurus]
Group
KEGG pathway 
Orthology groupMCL23319 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212583-TA
ATGTTGGCTTCGGCGGAATATGATAAACAGAGTTTTGAAAAAAGTAAGGGTAAGCTACGTTCATTTAAAAGAGGTGACTATGCACTAATCAAAACTAATCCTCGTAAACAAACTTCTTTGGATCTGAAAAATACTGAACCATACGAAATATACAAAATATTGGAACGTGATCGTTACATGCTAAAACGTGTAACCGGTAGAGGCCGGCCGCGTAAGTTAGCTCATGATCAATTACGTCCAGCACCAAATCCAGCAGCAGCAGGAACCGTGTCGGCGGATCAGATTGATGATCCTCCACATTACGACAGTCCATTAAATGTTGAAGCTTCTGAAAATTTAGAAGTTGAATCCAATGAACGATCAATAAACTTGGTCCAACATTCATTGCTTAGTAAAAGCCTCTGTGGACCACGGACAAAAGTCCTAGACCACAGCATCGACCTACCTCAATCCCTTAGAGCCCCTGTGGACCACGGCGTACGAGACCGCCCAGACCACAGCAACATCATAAAATCAATATCCTCTGTGGACCACGGACAAAAGTCCCAGACCACAGCATCGACCTACCTCAATCCCTTAGAGCCCCTGTGGACCACGGCAGCCCCTGTGGACCACGGCGTACGAGACCGCCCAGACCACAGCAACATCATAAAATCAATATCCTCTGTGGACCACGGACAAAAGTCCCAGACCACAGCATCGACCTACCTCAATCCCTTAGAGCCCCTGTGGACCACGGCAGCCCCTGTGGACCACGGCGTACGAGACCGCCCAGACCACAGCAACATCATAAAATCAATATCCTCTGTGGACCACGGACAAAAGTCCCAGACCACAGCATCGACCTACCTCAATCCCTTAGAGCCCCTGTGGACCACGGCAGCCCCTGTGGACCACGGCGTACGAGACCGCCCAGACCACAGCAACATCATAAAATCAATATCCTCTGTGGACCACGGACAAAAGTCCCAGACCACAGCATCGACCTACCTCAATCCCTTAGAGCCCCTGTGGACCACGGCAGCCCCTGTGGACCACGGCGTACGAGACCGCCCAGACCACAGCAACATCATAAAATCAATATCCTCTGTGGACCACGGACAAAAGTCCCAGACCACAGCATCGACCTACCTCAATCCCTTAGAGCCCCTGTGGACCACGGCAGCCCCTGTGGACCACGGCGTACGAGACCGCCCAGACCACAGCAACATCATAAAATCAATATCCTCTGTGGACCACGGACAAAAGTCCCAGACCACAGCATCGACCTACCTCAATCCCTTAGAGCCCCTGTGGACCACGGCAGCCCCTGTGGACCACGGCGTACGAGACCGCCCAGACCACAGCAACATCATAAAATCAATATCCTCTGTGGACCACGGACAAAAGTCCCAGACCACAGCATCGACCTACCTCAATCCCTTAGAGCCCCTGTGGACCACGGCGTACGAGACCGCCCAGACCACAGCAACATCATAG

Protein sequence:

>DPOGS212583-PA
MLASAEYDKQSFEKSKGKLRSFKRGDYALIKTNPRKQTSLDLKNTEPYEIYKILERDRYMLKRVTGRGRPRKLAHDQLRPAPNPAAAGTVSADQIDDPPHYDSPLNVEASENLEVESNERSINLVQHSLLSKSLCGPRTKVLDHSIDLPQSLRAPVDHGVRDRPDHSNIIKSISSVDHGQKSQTTASTYLNPLEPLWTTAAPVDHGVRDRPDHSNIIKSISSVDHGQKSQTTASTYLNPLEPLWTTAAPVDHGVRDRPDHSNIIKSISSVDHGQKSQTTASTYLNPLEPLWTTAAPVDHGVRDRPDHSNIIKSISSVDHGQKSQTTASTYLNPLEPLWTTAAPVDHGVRDRPDHSNIIKSISSVDHGQKSQTTASTYLNPLEPLWTTAAPVDHGVRDRPDHSNIIKSISSVDHGQKSQTTASTYLNPLEPLWTTAAPVDHGVRDRPDHSNIIKSISSVDHGQKSQTTASTYLNPLEPLWTTAYETAQTTATS-