Monarch geneset OGS2.0

DPOGS206913
TranscriptDPOGS206913-TA1149 bp
ProteinDPOGS206913-PA382 aa
Genomic positionDPSCF300001 - 1473335-1478780
RNAseq coverage441x (Rank: top 28%)
Annotation
HeliconiusHMEL0156281e-14684.88% 
BombyxBGIBMGA012870-TA4e-12279.70% 
DrosophilaCG9947-PA4e-12556.88% 
EBI UniRef50UniRef50_UPI00020615282e-10458.04%UPI0002061528 related cluster n=1 Tax=unknown RepID=UPI0002061528
NCBI RefSeqXP_002055015.12e-13060.78%GJ19142 [Drosophila virilis]
NCBI nr blastpgi|1953927464e-12960.78%GJ19142 [Drosophila virilis]
NCBI nr blastxgi|1565379382e-13161.40%PREDICTED: cell cycle control protein 50A-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00160203.9e-180membrane
KEGG pathway 
InterPro domain[4-338] IPR0050453.9e-180Protein of unknown function DUF284, transmembrane eukaryotic
Orthology groupMCL14148 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206913-TA
ATGGCTACATCAAGTGACACGTCTGATCAAAATGTGAAATCCAGACGCCCTGCAGAATCTGCATTCAAACAACAACGATTACCTGCTTGGCAGCCCATTTTAACGGCGGGCACCGTGCTTCCGACCTTCTTCGTGATCGGTATTGCTTTCATTCCCGTTGGAATTGGTCTACTTTATTTTTCCGATGAGGTGAAAGAGCATGTTATTGACTACACCTACTGTTTGAAGGAAGATGAGAATATAACTTGTGCTGAATTTATAAGACAAAACAACATGGATCCATGTGCTTGTCAAATACCATTCAATTTGACTGAAGATTTCAAGGGTGATGCGTACTTTTACTATGGTCTCAGCAATTACTACCAAAATCATCGTAGATATGTTAAGTCAAGGGATGACAGCCAGCTGCTTGGACGGCTCTCGTCTCCACCATCCTCAGATTGTGAACCATTTGCATATGCAGAAGAAGATGGAAAAATGAAACCAATAGCACCATGCGGAGCCATAGCTAACTCTTTATTTAATGACACCTTAACTGTACACTCTGTGGATTTAAATGTGGATGTCCCTGTGTTGAAAACTGGTATTGCCTGGACATCTGACAAAGATATCAAATTTAGGAACCCATCAGGTGACCTCAAGACTGCCTTTGCGAACTACACCAAACCAATTAATTGGCGTAAACCAGTGTGGATGTTGGATCCTAATAACTCAGAAAACAACGGTTTCCAGAATGAAGATCTCATAGTATGGATGCGTACAGCAGCACTGCCAACATTCCGCAAACTTTATCGCATTGTTGACCAGCAGGTTGGCTTCATTGCCGGTCTGGTTAAGGGGCCTTATGTGCTGAAAGTGGATTACAATTATCCAGTAACAGATTTTCAAGGTACCAAGACATTCATAATATCAACGACTTCTCTCCTGGGTGGAAAGAACCCATTTTTGGGAGTGGCATATGTGGTGGTGGGGACACTTTGTTTGTTACTGGGCATTGTACTACTGGTCATTCATGTTAGATGCTCTAAGAGATATCCCCCTCCCCTACCCCACTCCAGCTATATCGAACAAGTTCCTATTCAACAAAATATAAATATCACTTCCACAACAGAGATGATCAATGTTAACCCACGAACACCATATTCTTAA

Protein sequence:

>DPOGS206913-PA
MATSSDTSDQNVKSRRPAESAFKQQRLPAWQPILTAGTVLPTFFVIGIAFIPVGIGLLYFSDEVKEHVIDYTYCLKEDENITCAEFIRQNNMDPCACQIPFNLTEDFKGDAYFYYGLSNYYQNHRRYVKSRDDSQLLGRLSSPPSSDCEPFAYAEEDGKMKPIAPCGAIANSLFNDTLTVHSVDLNVDVPVLKTGIAWTSDKDIKFRNPSGDLKTAFANYTKPINWRKPVWMLDPNNSENNGFQNEDLIVWMRTAALPTFRKLYRIVDQQVGFIAGLVKGPYVLKVDYNYPVTDFQGTKTFIISTTSLLGGKNPFLGVAYVVVGTLCLLLGIVLLVIHVRCSKRYPPPLPHSSYIEQVPIQQNINITSTTEMINVNPRTPYS-