Monarch geneset OGS2.0

DPOGS207969
TranscriptDPOGS207969-TA1233 bp
ProteinDPOGS207969-PA410 aa
Genomic positionDPSCF300090 + 542506-547876
RNAseq coverage409x (Rank: top 30%)
Annotation
HeliconiusHMEL0070660.092.27% 
BombyxBGIBMGA000321-TA4e-16582.95% 
DrosophilaCG6664-PD4e-13766.92% 
EBI UniRef50UniRef50_Q2PDZ35e-13566.92%CG6664, isoform D n=34 Tax=Coelomata RepID=Q2PDZ3_DROME
NCBI RefSeqXP_969333.18e-14272.64%PREDICTED: similar to CG6664 CG6664-PA [Tribolium castaneum]
NCBI nr blastpgi|3072144652e-14166.28%Coiled-coil domain-containing protein 6 [Harpegnathos saltator]
NCBI nr blastxgi|3320261472e-15269.75%Coiled-coil domain-containing protein 6 [Acromyrmex echinatior]
Group
KEGG pathwaytca:6578052e-141 
 K09288 (CCDC6, PTC)maps-> Pathways in cancer
    Thyroid cancer
InterPro domain[12-339] IPR0191523.1e-194Protein of unknown function DUF2046
Orthology groupMCL14332 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207969-TA
ATGGATGATACAGTAACTTCTCCAGGTAATAAAATGGGTGACTCGGCATCAGAGAGTGACTCGAGTTCGCTCGATGGCGGAGCTATGTTGCCTCCAAGCACCGTTTCTCGGGATCAGTTGCAGAAGAGAATTGAATCATTGCAACAACAAAATCGAGTTTTAAAAGTTGAACTTGACACCTATAAATTGAGGGTGAAGGCTTTGCAGGAAGAAAATCGTGCACTGCGACAGGCATCTGTGTCAATTCAAGCAAAGGCTGAACAAGAAGAGGAATATATTTCTAACACTCTGCTCAAGAAGATACAGGCCTTAAAGAAAGAAAAGGAAACTTTGGCTCATCACTATGAAAGGGAGGAAGAGTGTCTCACAAATGATCTATCTCGTAAATTGAACCAATTGAGACAAGAAAAGTGTCGCCTTGAGCAGACATTAGAGCAGGAACAAGAATGTTTGGTTAATAAATTAATGAGAAAAATAGAAAAATTGGAAGCGGAGACATTTGCTAAGCAGACTAACTTAGAAAGACTCCGCAGGGAGAAGGTAGAATTAGAAAATACCTTAGAACAGGAACAAGAGGCGCTAGTAAATAGGCTTTGGAAACGAATGGATAAGTTGGAAGCTGAGAAACGTTCGCTACAAATAAGATTAGATCAGCCAGTTTCAGATCCAGCCAGTCCTAGGGACATCAGTAATGGAGATACAGCATCTAATTTAAGTAGTCATATTCAAACTCTGAGATCAGAAGTTGTCAAATTGAGGAACCAACTGGCCTTTTCCCAGAATGAAAGTAAAGAAAAAATGCAACGTTTTGCACTAGAGGAGAAACATATCCGAGAAGAGAATGTACGATTACATAGGAAATTACAACAAGAGGTCGAACGTCGCGAAGCTTTGTGCCGACATCTTTCTGAAAGTGAATCTTCTTTGGAGATGGAAGAGGAACGCCAGTTTAATGAGGCTCTAAATGCGCGATCTAGGAGCGTGTCGTCTCCGGGCGGGTCGCGCCCACTGTCTCCATACGCTCTGCACAACCCAGCTAGACCGGCGCTGCACTTTAATTCACAGCAAGCACGTCGTGCTAGCGAGCGTTTCGTGAAGCCAGCTGTCCCCGGTGCGGGTGTGTCGTTGCCGCCCCGAGTTCCTCTAGAGTCGGCCTCCGCCCCCGCCCCTCCGGCGCCCCCCTCGCCCTCGATGCAACCAGCCAGCCCCATGGACACATCCAGCAGAGAATGA

Protein sequence:

>DPOGS207969-PA
MDDTVTSPGNKMGDSASESDSSSLDGGAMLPPSTVSRDQLQKRIESLQQQNRVLKVELDTYKLRVKALQEENRALRQASVSIQAKAEQEEEYISNTLLKKIQALKKEKETLAHHYEREEECLTNDLSRKLNQLRQEKCRLEQTLEQEQECLVNKLMRKIEKLEAETFAKQTNLERLRREKVELENTLEQEQEALVNRLWKRMDKLEAEKRSLQIRLDQPVSDPASPRDISNGDTASNLSSHIQTLRSEVVKLRNQLAFSQNESKEKMQRFALEEKHIREENVRLHRKLQQEVERREALCRHLSESESSLEMEEERQFNEALNARSRSVSSPGGSRPLSPYALHNPARPALHFNSQQARRASERFVKPAVPGAGVSLPPRVPLESASAPAPPAPPSPSMQPASPMDTSSRE-