Monarch geneset OGS2.0

DPOGS202369
TranscriptDPOGS202369-TA1884 bp
ProteinDPOGS202369-PA627 aa
Genomic positionDPSCF300104 + 137716-142265
RNAseq coverage126x (Rank: top 57%)
Annotation
HeliconiusHMEL0171676e-3158.16% 
BombyxBGIBMGA013791-TA4e-0847.06% 
Drosophila% 
EBI UniRef50UniRef50_UPI00019273F52e-2833.25%UPI00019273F5 related cluster n=2 Tax=unknown RepID=UPI00019273F5
NCBI RefSeqXP_002166732.13e-2933.25%PREDICTED: similar to YALI0B15400p [Hydra magnipapillata]
NCBI nr blastpgi|2211323776e-2833.25%PREDICTED: similar to YALI0B15400p [Hydra magnipapillata]
NCBI nr blastxgi|1951303012e-9342.21%GI15442 [Drosophila mojavensis]
Group
KEGG pathway 
Orthology groupMCL34440 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202369-TA
ATGAGTTTTATGTTTAAACCGGCGACTACAACTACAGCGAGCACTTTTGGCTCCAATACACCTTCAGGATCAATTTTCGGCGGTGGTGATACGAAAACTACTACAGAAAACAAGCCAAGTATTTTCGGCAGCCCTGCTAGTAGCACACCGGGCTTCGGCTTTTCAATCGGAGCAAAAACAACAGCCTCAACAAACTTATTCGCTTCCACCCCAACTACAAGTACAGCATTCGGTGCGAGCAAGCCCTTATCATTTGGAGCCTTCACATCACCCCCGGTCCAGACCGTTGGTACACCAGGATTCGGATCACCTCAAACAACTAATGCGGAACAGAAGCCAGCGTTCACGCCAACACCGGCGTTCGGGACTAACACGACACAGGTAATGTTCGTAAAATTTTTAAAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCACAACCAACATTCGGCCAAACCTCCCAAGCTCAATCAGCATTTGGCCAAACCTCTCAAGCTCAGCCAGCATTTGGCCAAACTTCTCAAGCCCAACCAGCGTTTGGCCAAACTTCTCAAGCCCAACCAGCGTTTGGCCAAACTTCTCAAGCCCAACCAGCGTTTGGCCAAACTTCTCAAGCCCAACCAGCGTTTGGCCAAACTTCTCAAGCCCAACCAGCGTTTGGCCAAACTTCTCAAGCCCAACCAGCGTTTGGCCAAACTTCTCAAGCCCAACCAGCGTTTGGCCAAACTTCTCAAGCCCAACCAGCGTTTGGCCAAACTTCTCAAGCCCAACCAGCGTTTGGCCAAACTTCTCAAGCCCAACCAGCATTTGGCCAAACATCCCAAAATCAATCAACTTTTGGTCAATCAACACAGCCGTCATTGTCTTTCGGACAAACCTCCCAAGCCGCTCCAACCTTTGGACAAACAACGCAGGCCCAAACAACACAATCATCAGCATTCGGTCAAACAACACAAGGCTTAG

Protein sequence:

>DPOGS202369-PA
MSFMFKPATTTTASTFGSNTPSGSIFGGGDTKTTTENKPSIFGSPASSTPGFGFSIGAKTTASTNLFASTPTTSTAFGASKPLSFGAFTSPPVQTVGTPGFGSPQTTNAEQKPAFTPTPAFGTNTTQVMFVKFLKHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKHNQHSAKPPKLNQHLAKPLKLSQHLAKLLKPNQRLAKLLKPNQRLAKLLKPNQRLAKLLKPNQRLAKLLKPNQRLAKLLKPNQRLAKLLKPNQRLAKLLKPNQRLAKLLKPNQRLAKLLKPNQHLAKHPKINQLLVNQHSRHCLSDKPPKPLQPLDKQRRPKQHNHQHSVKQHKA-