Monarch geneset OGS2.0

DPOGS208429
TranscriptDPOGS208429-TA1653 bp
ProteinDPOGS208429-PA550 aa
Genomic positionDPSCF300095 - 249619-272243
RNAseq coverage158x (Rank: top 52%)
Annotation
HeliconiusHMEL0072126e-8866.14% 
BombyxBGIBMGA009035-TA1e-6252.78% 
Drosophilaspz3-PA6e-9551.45% 
EBI UniRef50UniRef50_E1ZV812e-9458.97%Putative uncharacterized protein n=2 Tax=Camponotus floridanus RepID=E1ZV81_CAMFO
NCBI RefSeqXP_001121955.12e-9557.37%PREDICTED: similar to Spz3 CG7104-PA [Apis mellifera]
NCBI nr blastpgi|3838511074e-9757.49%PREDICTED: uncharacterized protein LOC100875106 [Megachile rotundata]
NCBI nr blastxgi|3838511071e-10453.83%PREDICTED: uncharacterized protein LOC100875106 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL12844 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208429-TA
ATGGCTTTGGCAAACTTTACTGTGGCTACTAGCGGTGTTGCCACTCCGGCACTCCCAGCAACCACTGCAGCGCCTTTTCGAGTAAATCTGGATGATTTGTTATACTCCCCCGAAGAACAACAGGCTGGATATGCCCCTCATTATAGTGGCAACGAACCATTCACACCCCCTAATTTACAACAGCACAGAAAGTCTAAAGTTCGAAAACGTGCTGGTACTGCATACCTTCCGCCACCAAATCAATTCACGCATTATCCACGACAAGATAACATACCTCAGAATCCTTTCAATAGTCCACTACACTCACAAATCTATCAACAAAATACAAGGCTTGGCATTCAAATAGAAAACCAACGAGTTCATGATATAAATTCTCAGTTCGATAATAATGGACAGTACGTACATGATCCAACTGGCGACTATGATTACGAACAAAAAAGAACTGATAAAAATATGCCAAATCTTGCCACATACAATCCGGGATACGTGGATCTACCTTACGCACCAGCTTCAACGCAGAGTCCTTTATCTAGTCGAAGGTTTGAGAATATACCAAACCAAAATACAAATACTAATGTTCCGAGTATGACAACGTCACCTCAAAGTTCTCGAAAGCCAACTGATGCTGGATCTAACACAAGTGGAAGCAGTACCAATACACAGGTACCTCCAGACCGTCCTCGAGGTTTTACTAAAGTGGAAACAGGAGGCACGGGCGGCAAAACTCAATTGCACGCCATTTTGGATTACGACGACGATTATTATGAAGATGTTCCTGGATCAGATGTAGGACAAGCAGTGACCCCGTTACAAGGGCCGATATTATTGCGTAACGGGACCGTGCCTGTTGTTCCCCTAACCTCCTATCCCACTGTCAACAATGGATCGTTTTACCAAATACCTATCCTGTGGACCGCCCTGTCTCTAGCCTTGGGGTACGAACTCCAAGGTCAAGTGATTAGAGGAGTGCCGTGCGTCAAAAGAAACTTTCAACTCTATTGTCCAACCGCTGGCAACACGTATCCTCTAGATAAGATCGAAACATTTATAGATGAAAACAAAGCTTTAATAAAGCGAATGTATGGTTCATTCACGGTTCCCGGCGGAACTAGAGTGAGAAGAGCGCCAGGGGTTCCCGATATGCACTCCGGAGACTCCTACTTCCGACATATAAGACAGACGAAACCGAAACTACCAGATACACAATCAGTTAATAATACCGGAAGGATAGATGCTTGTGAAAGTAAAACTGAAATAATGACCCCGTACTGGGCTTTGAACTCTGCAAGGAAAGTTCGAGCAATTGTGAACACGATGCACTTCGAACAGGCCATACACCAGGAGGTTTGCAGCAAAAAGTCAACATCAAGATGTTCGGCGGACTGTGGTTGCGAACAGAAGTACAAGTGGCATCGACTCCTCGCCTACGATCCCAATGACGACTGCGCCGGTATATTCATGGACTGGTTCTTATTTCCCTCCTGTTGTGTTTGTAGCAAAAAGTCAACATCAAGATGTTCGGCGGACTGTGGTTGCGAACAGAAGTACAAGTGGCATCGACTCCTCGCCTACGATCCCAATGACGACTGCGCCGGTATATTCATGGACTGGTTCTTATTTCCCTCCTGTTGTGTTTGCAGGTGTAAACCGTAA

Protein sequence:

>DPOGS208429-PA
MALANFTVATSGVATPALPATTAAPFRVNLDDLLYSPEEQQAGYAPHYSGNEPFTPPNLQQHRKSKVRKRAGTAYLPPPNQFTHYPRQDNIPQNPFNSPLHSQIYQQNTRLGIQIENQRVHDINSQFDNNGQYVHDPTGDYDYEQKRTDKNMPNLATYNPGYVDLPYAPASTQSPLSSRRFENIPNQNTNTNVPSMTTSPQSSRKPTDAGSNTSGSSTNTQVPPDRPRGFTKVETGGTGGKTQLHAILDYDDDYYEDVPGSDVGQAVTPLQGPILLRNGTVPVVPLTSYPTVNNGSFYQIPILWTALSLALGYELQGQVIRGVPCVKRNFQLYCPTAGNTYPLDKIETFIDENKALIKRMYGSFTVPGGTRVRRAPGVPDMHSGDSYFRHIRQTKPKLPDTQSVNNTGRIDACESKTEIMTPYWALNSARKVRAIVNTMHFEQAIHQEVCSKKSTSRCSADCGCEQKYKWHRLLAYDPNDDCAGIFMDWFLFPSCCVCSKKSTSRCSADCGCEQKYKWHRLLAYDPNDDCAGIFMDWFLFPSCCVCRCKP-