Monarch geneset OGS2.0

DPOGS203516
TranscriptDPOGS203516-TA1233 bp
ProteinDPOGS203516-PA410 aa
Genomic positionDPSCF300055 - 468032-469763
RNAseq coverage110x (Rank: top 59%)
Annotation
HeliconiusHMEL0041080.074.21% 
BombyxBGIBMGA004346-TA1e-6270.86% 
DrosophilaCG11170-PB2e-1222.37% 
EBI UniRef50UniRef50_F4WS472e-5636.80%UPF0672 protein C3orf58-like protein n=3 Tax=Myrmicinae RepID=F4WS47_ACREC
NCBI RefSeqXP_001606587.14e-4731.86%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3227791981e-5735.22%hypothetical protein SINV_11522 [Solenopsis invicta]
NCBI nr blastxgi|3227791983e-6035.38%hypothetical protein SINV_11522 [Solenopsis invicta]
Group
KEGG pathway 
InterPro domain[52-147] IPR0205198e-07Uncharacterised protein family UPF0672
Orthology groupMCL16439 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203516-TA
ATGAGCGACAAAATCTACAAGATGATATTGCTTAAACGGCGTCTGTGTAAACGATTAGAGATTGTTGTTATTGCTATGCTCGCAATATCTTTTTACATATCAGTTTTATTATTCGGCGGTTTTAAGGGTCCAGAGATAATTCATGTAACCGATTTACATCGATGTCCAGCTTGCTATGGGGTCACAGTGTGCCCAGAACTTTATTCAAATCAAATCATATTGGATTCTTCTCACTGGTCGAATATGTTTAACGCAAAGAACATTTATCATGGTTACACAAAGTCCAGTAGGAGAGTTATATTAAAAAAGCTAGCTCACGATTGGGAGCTGAATGAATTTGATTCAAAACTTTGCTCCGAATGGAGATTAAATAATGATTGCAAACCTGTACAATTGTTAAATATGACAAATATAGATGATAAAGTTCTAGACATAGTTGAATATAATATAACATGGCCTGATACTGAGCCAAGGAAAGGACTGGTGCTGTGTCCATATGCATACAGCATATATGACCTTCTTCAACCAGTGTTTAGCAATAATAAAAATAACTATAAGTCTGAAATGTTGAACATATGGACCATGCTGAGTATAAACCCAGAACCTATCATACTACAGGTTTTACCAAGATCAAAGGGCTGGCCAGTTCCAGCTTTCGGTGGAGTCTGTGGTCGTCTGGAAGTGGTAGCTTATGAAGGCGAACCTTTGTCTTCACTACTACATGTCGCGTGGCATCGGAAACTTAATTACGCTCAGAAGATACTTGAAGCTGCCATGGATTTTACTTTTAAGCATGAAAGATTTCGCTTCTACCTCATGGATTGGTCATTAGATAATATCGTAGTCAACGAACGAGATGAGATCACATTTGTCGACTTGGAAGACGTGATTGTTCTGGATAAGCATATTTCACCGAAACCGGAGCTGGTAGATTGGTACAAGCGTTACAATAGAGAATCTATAGGGCCTGGGTTTTCATTTTCTATAGAGAACATGTGTAAACACCACTTGAGCGATCACAATCTGTGGGCAGCATGTTATATATTGATTGGTGATGAAAGTCCACTGCTGTATCCTATACCAAAGGAGGTGAACGCATCCCGACCCTACTTCAATAGACTATTGTCGGCTTGCCTGAACGGGGATGACAGATTCAAAACAATACGGAAACTCCAACATGTTATTGAAGAAATGCTGAACGATGAAAAGATTCTCCGATCTGGAGTCAGTTGA

Protein sequence:

>DPOGS203516-PA
MSDKIYKMILLKRRLCKRLEIVVIAMLAISFYISVLLFGGFKGPEIIHVTDLHRCPACYGVTVCPELYSNQIILDSSHWSNMFNAKNIYHGYTKSSRRVILKKLAHDWELNEFDSKLCSEWRLNNDCKPVQLLNMTNIDDKVLDIVEYNITWPDTEPRKGLVLCPYAYSIYDLLQPVFSNNKNNYKSEMLNIWTMLSINPEPIILQVLPRSKGWPVPAFGGVCGRLEVVAYEGEPLSSLLHVAWHRKLNYAQKILEAAMDFTFKHERFRFYLMDWSLDNIVVNERDEITFVDLEDVIVLDKHISPKPELVDWYKRYNRESIGPGFSFSIENMCKHHLSDHNLWAACYILIGDESPLLYPIPKEVNASRPYFNRLLSACLNGDDRFKTIRKLQHVIEEMLNDEKILRSGVS-