Monarch geneset OGS2.0

DPOGS206836
TranscriptDPOGS206836-TA2520 bp
ProteinDPOGS206836-PA839 aa
Genomic positionDPSCF300001 - 3303519-3310175
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0132580.088.74% 
BombyxBGIBMGA012783-TA0.069.04% 
DrosophilaCG6761-PA9e-12640.80% 
EBI UniRef50UniRef50_UPI00022464A51e-17441.19%UPI00022464A5 related cluster n=1 Tax=unknown RepID=UPI00022464A5
NCBI RefSeqXP_395466.23e-15946.71%PREDICTED: similar to CG6761-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3454817054e-17441.19%PREDICTED: uncharacterized protein KIAA1841-like [Nasonia vitripennis]
NCBI nr blastxgi|3454817053e-17042.65%PREDICTED: uncharacterized protein KIAA1841-like [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[253-550] IPR0217775.7e-92Protein of unknown function DUF3342
[223-357] IPR0113337.2e-07BTB/POZ fold
Orthology groupMCL15219 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206836-TA
ATGTCTAGCCCAAACCCAGAAACAGACTCATCGTCAATGTTTATGCAAAAAGGCGATGGTAGAAATACTAGAAGTCCCACTAAATGTGAATCATCTTCCAGAAAAAATTCCCTTGTAAAGGCAAGTATACCGGCTGAGGTGATTACTGTGAAAGATTTTTTTGACTTCATGAAAACAGCTTATCAAGTTTATGAGCATCTAGAAGGTGATGAGGACGAAGGGTCAAAAATTAACTGGGAAGAATTGACTAAGCTGACGGTAATAAATCATGTAATAGGACAACAGGAAAGAGATCTTTTGCAACAAACTGCGTTCACTGGTCAGAAAACTATAACAAGCCAAAGTGATACAGTGATTGAAAGGAAGAATAGCGATACAGAACGAACCGATAAGAGATATAGTTGTAGTGAAATTGAAGGGCTAGGGCGTAGGCGGTCCTTGCCTCCAGAAGCTCGTGTAGAGATGCTTGGCGTTTGGACAGAAGCTAATTTAAACTGCAAATTGAATGACGTTATAAATGAAGGAATTTTGGACTCTATACTTCCCTACCTTGTCGGCTATAAAAAACCTTCAAAAACAACGAGTGTTTACGCACCGATTATAAAAAAGTCACCATCAAGTTCTTCTACTGAAATCAAAAAGCCGGCTAGTTTTGGTGGTTTCGTCAATGAAAAAGAATCCTGGGACCGTTTGGGTAGACGTAAATCTTCTGTAACAGCAGCACAAGATCGCGTAAATCAGAAACAAGAAGGCGACGTTGAAATTCATGTTTGCGATGAAGTAAAGGGTTTGAAGAAGGATTTTAGATGTCCCCAAAAACTACTTATATCTAAAATGGGCTACTTTGCTGACGTGACGGCGGGTCAGCGTTTGGAAGACATGGATATATCTGTACACTGCGACATCCAGATATTTGATTGGTTGATGAGATGGGTGAAACGAGACACGATTCTCGTAGCAGACTGGCCGCTACTGGATCCACAGAATGTTGTGCCCATCCTTGTATCAGCTTCATTTTTGCAGATGGAGCCCCTACTCCACGATTGTTTGATTTACTGTCATGCACACATGAATGATATAGTCAAGACTTCAACCAACCTGGCCTGCTTGAGCGATGCTCTTTTAACTAGGTTAGCAGCTATGTACACAAATGCAGAGCTCGAAGCCATCAGAGATCGTAAAGATAAGATACAATCTCGTCTGTACTGCAAGATGATCATGTCACTGGCAGAACCGGTACCGGAAACATTAAGGGGACATTACGCAACACTTGCCACGCTATTTAAGTGCAGCAAATGCAACAAGCTGCTCGGTCGACACATCGCTGGACAGGTCCCCTGCCAGCCTTCTTCCATGCGCATCGACAGACGAGGGAACGTCATCTCGGAACACACCAAGGATCCGTCTTGGAGTCTCAATGAATATATTCGCTGGTTGTATGGTGAGCTACGATCATGGCGTCGCGTCTACTGGCGGCTGTGGGCCGACTGTCACTTCCTACATTGTCAGCTCTGCGACTCCTATTTCCCTGCTTATCAGATGGAGTGGTGTTCTCACCATGAGCAGAGCCCCCAAATGTTTGCTGTACAGGGTGCTCCGCTGGCGGCTGGGCGCTACGCTTGCTGTGGGGAGAGAGCCTACCGATTCGAGACACTTACGAGGAACACCGGTTGTCAGTTTAGAGAACATGTCCCAGAGGAGCGGCTGCCCGCAGATGCAGCAGTTATGGAGATCTACACTCAGTTCCGGGACATCATCGCCATGAGGCCTCCACAGCTCATGTTCCCTGAACGGCTTACAAGACTGGTACCTAGAGAAGGCTCGGTGTCAGGGGCTCGACTGCAGTGCTCGGAGGTGTACTGGTGGCAGGGCGTGCAGTTGGTTCCACCTCGGGCTCCGCTGGGGCTGCTCGGACGACTGTACACCAGCGAACAACAGTTCAACCGTGAGATCTGGGGCGCTCTGGTCCGCCCCGGCTCGCCCCCCGTCGGTGCGGTTCGTAGTTCCGCGCAGTCACTCGCTGGGAACGCAACAGGTGCTTGCCCTTCACCTGACCAGCGAGAAGTTAAAAAACCCAAGGAAGATAGTCCGAGAAGTAAGCAAGGCGTTGGCAGCGGTTCACTAGAGGCGTGCGAATCTAGCGACTCAAGTGAACAGAGCGATGACAGCGATCGTAGCGACGAAGACGCGCCGCGACGGTCCAGGACAAGGAACAGGACCGCGCCTCCCGCCAGAAACACTGTTCCTCGCCGTGGACGTGCACCGGGTCGCGTGTGGGTATCCACCCAGAGCGCTCGTAGTAACCAAGATGGACAGAGGAGGTTCGAGGAGCGCGCCGCTTCACATATGAGGGCCGCTTTAGTAAGGAGACACCATCACACGTCTCGAACAAGACCCCACCCAGGCGGTGTATACGCACGCTTGGAGGCGGAATGGCGTGAACGTCAAGGCTGCGTCACTACAACACGGACGCGACAGGCTGCCGCCTCACGCGCACGGCCCGCTAATAAATACAAATAA

Protein sequence:

>DPOGS206836-PA
MSSPNPETDSSSMFMQKGDGRNTRSPTKCESSSRKNSLVKASIPAEVITVKDFFDFMKTAYQVYEHLEGDEDEGSKINWEELTKLTVINHVIGQQERDLLQQTAFTGQKTITSQSDTVIERKNSDTERTDKRYSCSEIEGLGRRRSLPPEARVEMLGVWTEANLNCKLNDVINEGILDSILPYLVGYKKPSKTTSVYAPIIKKSPSSSSTEIKKPASFGGFVNEKESWDRLGRRKSSVTAAQDRVNQKQEGDVEIHVCDEVKGLKKDFRCPQKLLISKMGYFADVTAGQRLEDMDISVHCDIQIFDWLMRWVKRDTILVADWPLLDPQNVVPILVSASFLQMEPLLHDCLIYCHAHMNDIVKTSTNLACLSDALLTRLAAMYTNAELEAIRDRKDKIQSRLYCKMIMSLAEPVPETLRGHYATLATLFKCSKCNKLLGRHIAGQVPCQPSSMRIDRRGNVISEHTKDPSWSLNEYIRWLYGELRSWRRVYWRLWADCHFLHCQLCDSYFPAYQMEWCSHHEQSPQMFAVQGAPLAAGRYACCGERAYRFETLTRNTGCQFREHVPEERLPADAAVMEIYTQFRDIIAMRPPQLMFPERLTRLVPREGSVSGARLQCSEVYWWQGVQLVPPRAPLGLLGRLYTSEQQFNREIWGALVRPGSPPVGAVRSSAQSLAGNATGACPSPDQREVKKPKEDSPRSKQGVGSGSLEACESSDSSEQSDDSDRSDEDAPRRSRTRNRTAPPARNTVPRRGRAPGRVWVSTQSARSNQDGQRRFEERAASHMRAALVRRHHHTSRTRPHPGGVYARLEAEWRERQGCVTTTRTRQAAASRARPANKYK-