Monarch geneset OGS2.0

DPOGS209178
TranscriptDPOGS209178-TA1209 bp
ProteinDPOGS209178-PA402 aa
Genomic positionDPSCF300061 + 161071-162483
RNAseq coverage941x (Rank: top 14%)
Annotation
HeliconiusHMEL0097502e-16872.70% 
BombyxBGIBMGA011533-TA6e-14260.70% 
Drosophila% 
EBI UniRef50UniRef50_E2C4M05e-1823.42%Protein FAM175A n=4 Tax=Formicidae RepID=E2C4M0_HARSA
NCBI RefSeqXP_395910.32e-2029.73%PREDICTED: similar to coiled-coil domain containing 98 [Apis mellifera]
NCBI nr blastpgi|3227843822e-2027.31%hypothetical protein SINV_05455 [Solenopsis invicta]
NCBI nr blastxgi|3112628735e-2125.14%PREDICTED: BRCA1-A complex subunit Abraxas-like [Sus scrofa]
Group
KEGG pathway 
InterPro domain[9-27] IPR0232387.1e-15FAM175 family
Orthology groupMCL18904 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209178-TA
ATGGCTTACACGGAGAAAGTGTCTCTCAACGGTACCGCGCTCTCTTTTTTACTATACGAATGCATGAATTCTGTTAATAGTCAGGAGGGCTTTCTGATTGGAGACGTAACTTCACAAATCACCAACCATATATCTGACTCCCAAAATGACAACGCTAGATTGGACACTCAAATTGTGATTAGGACCGTTCTGCCTCTACCTTCAGTGTCGCTGTTTTATTTACCTACCGGTTCAATTAAAGAGGACGTACTATCGGATCTAATATCGAGTGCCGCAGCCGAAATTGTTGGTTGGTATAAATATAGAAGAAATAGTAGCATTAAACCAACGTTCAGAGACAAGTTAATATCAAGAGGCTTACAGAAGTATTTCGAGAAATATCACGGCAAGAAAACTTTTGTTTCATGCAATCTTTCAAGTAAATCTTCATCTGCATGCTCCACACATACAGTTATATACAGATTTGGAAAAATAAATTGCTTCGATATGTACGAATATGTTGAGGATGTGACCGCAAACTTGGGTGAAAAGTTATCAGGTTATAAAAAATCAAAAAGAGTTTCACCTCATTGCATATTTAACAAAATTGTTAAGGAGAGTAATGTAGACGTCAATAACACAAGGGAAGCTATATTGTCCATTCAGGAAGCGGTCGATGTGGCATTGATTAGGGAAGCCAAGGTAGCAGCTAGGAATGAATGCAAAGTCAAAGAACTTGAGGCCGAAATTAAGCAAATGAGTGCCATATTAGCTGAGAAACATGAAGCCGATTTGAGTTCGGCCTATAACAAAGTACTGGAAAAGAAATATATCAATAGGGATGTTGAAATGGCGCAGGCCTGTATTGACGCGATTAGAACACCTGCCAGTGTGGACATCCTCAGCATTCCAAATGTTATGATTATATCCAACATAACCGAATCAACCGTCGAAACCCCGATTCAGCACACCACTGATGTTAACGGGAGAGGCCAAAGTCCGAGTGTAGATTGCTCAAATCTCGTACTACCTTCCACATCGGGACAATCACCTTCATTGAACTACGCCGCAGCAGTCAAGAAACCCTCAGACGGAGGATCCTGCAGCAAAACGAAAAATCTCAATGAAGATCTCATCACCTTTGATGTAGAGGACAAGAGCCTGAGGACACAAAAACAGGTCATTGTCAATGACAACATCAGTTTAGGCGGAAGCTCGCCCGAGTATTGA

Protein sequence:

>DPOGS209178-PA
MAYTEKVSLNGTALSFLLYECMNSVNSQEGFLIGDVTSQITNHISDSQNDNARLDTQIVIRTVLPLPSVSLFYLPTGSIKEDVLSDLISSAAAEIVGWYKYRRNSSIKPTFRDKLISRGLQKYFEKYHGKKTFVSCNLSSKSSSACSTHTVIYRFGKINCFDMYEYVEDVTANLGEKLSGYKKSKRVSPHCIFNKIVKESNVDVNNTREAILSIQEAVDVALIREAKVAARNECKVKELEAEIKQMSAILAEKHEADLSSAYNKVLEKKYINRDVEMAQACIDAIRTPASVDILSIPNVMIISNITESTVETPIQHTTDVNGRGQSPSVDCSNLVLPSTSGQSPSLNYAAAVKKPSDGGSCSKTKNLNEDLITFDVEDKSLRTQKQVIVNDNISLGGSSPEY-