Monarch geneset OGS2.0

DPOGS209945
TranscriptDPOGS209945-TA1626 bp
ProteinDPOGS209945-PA541 aa
Genomic positionDPSCF300148 - 361072-365612
RNAseq coverage277x (Rank: top 39%)
Annotation
HeliconiusHMEL0100004e-6955.52% 
BombyxBGIBMGA011262-TA8e-7842.75% 
Drosophila% 
EBI UniRef50UniRef50_Q5RH015e-0931.25%Condensin-2 complex subunit H2 n=2 Tax=Danio rerio RepID=CNDH2_DANRE
NCBI RefSeqXP_002399696.19e-1034.62%condensin-2 complex subunit H2, putative [Ixodes scapularis]
NCBI nr blastpgi|1489222581e-0829.32%Si:dkey-202b22.2 [Danio rerio]
NCBI nr blastxgi|3224953033e-1825.54%proteophosphoglycan ppg1 [Leishmania mexicana MHOM/GT/2001/U1103]
Group
KEGG pathway 
InterPro domain[58-99] IPR0093783.4e-06Non-SMC condensin II complex, subunit H2-like
Orthology groupMCL25827 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209945-TA
ATGAATTCCCAAAGGCTGGAGGAGATAGTGGCGGAGCTGATGAAGCCAATCAGCGACGTCCGCCGCAGTTTCGACACGGATCTCAGCGCGTTACTGGAGGAGTACCTGACGGAGGCGGGGCAACAGGCTCTAGAAGCCGAGGCCAGTGGCAATCATTGCTACAACACACCTAATTTTGCAGAGGTGGCTCTCTTGCTACAGCAGTCGGCCAGTATCTATGGTCGCAAAGTGGACTGTCTCTACTCTCATGTGCTATGTGTCAGTGATGCGCTTCACAATAACACTCAAGAAACTAACGTGTTGGCCGACGAGACGCACACTCCCAGCGGCGGCCGGAGGAAACGGAAGGCGTCCGTCAGCGGCGACTTCGACTACATCGCGCTGGAGACCTGCGGCGCCGCGCGTAGGGACGCCGGGCCCTCGCGACCCCCGCCCACACTGCCCAGGATGTACGTGGAGCTCGAGCCCAGAGTCGTTTCCTCACACGACCATCAGCTCACAGATTACCTCGGGGAGCCCATAGGACTGTTGGCGGACTTCAACGTCTCGTGGAGGCTACGGAACGGGTTGCTGGTAGATGAGCTGGCTAGCACCGAGGGCGGCGCACCGGGACTGCGACCTGCGCCGCTGCTGGAGCTGCGTGCGGCCATGGAAGCCGCCGCGCCTCCCTCGCCCCCGCCCGCGACCTCCTCCCCCCCTCCTGCGCCCTCCTCACCCCGGCCCGAGCAACCCTCTTCACCGCCGCCGTCCGCTCCTGACTCGTGTTCGACGCCTCTGCCCCAAAGGAAGGAGGTTAGGAGGAAGCGACGGAGCGAAGTCAAACTTGAGGATATTGTGGACGGACAAGTCAAACTGCTTATCAGCAAAGAGTTGCGAGGTAAGTTGCGGCGTGTTGAGGAGTTCAGCTTGCCGGTGGACTGGGTCGCCAGGGTCGTGGAGGGCCGCGCCTCCGCCGTGAGGGAGCTTCGGCGCGGACTGCGGGGACACCGCGCCGAGACAGAATTCCGCGGCTTCGACGTGACGAACTCTATGGACGTTGGAGGGTTCCTCGGCTGGAGCGGGCCGGAGGCGGCGGCGGCGGCGGCGGCGCTCAGCGCGGCCGCCGCCGCCAGGCTCGACGACAGCGACGACGACGGCTTCTTCGAACAGAGCTCGCTCGGCGACTCCGACACCTCGCGCGCCGACGACACCGGCGCCACCGCGCTCTCGGTACCTACGCGCCCTATCCCGCGACCTCCCCGGCGACCCCTTGTAACAGCAGGCGTGTGCTTCCAGTCGTTGCCGGGCAGCGGCTGCGAGTGGTGGAGCTGGCGCGAGGCGGTGGTGTCGCGCAGCACGGCGGCGGCGGCGCGCGGCGCCGATGTGAAGGAGGGTGCGCGGGCCGTGCTCGCGGCGGCCGGCGCGCTGCCCTCGCCCGCCGCCTTCGACGCGGTGCTGGCGGCCGCCGCCGAGCAGACGCACGACGTGTCCCGCCTGTTCCTGTCCGCGCTGTTCCTGGCCAACGCGGGCTCCATCGAGATCGTGCCGGGCGCCCCCCTGTCGCTGAACTCGTTCGGCATCCGCGTGTTGTCTCGCGACGAGCGTCTCTACCTGTCGGTGGTGGAGGACCGTCCGCCGCCGCGGTAG

Protein sequence:

>DPOGS209945-PA
MNSQRLEEIVAELMKPISDVRRSFDTDLSALLEEYLTEAGQQALEAEASGNHCYNTPNFAEVALLLQQSASIYGRKVDCLYSHVLCVSDALHNNTQETNVLADETHTPSGGRRKRKASVSGDFDYIALETCGAARRDAGPSRPPPTLPRMYVELEPRVVSSHDHQLTDYLGEPIGLLADFNVSWRLRNGLLVDELASTEGGAPGLRPAPLLELRAAMEAAAPPSPPPATSSPPPAPSSPRPEQPSSPPPSAPDSCSTPLPQRKEVRRKRRSEVKLEDIVDGQVKLLISKELRGKLRRVEEFSLPVDWVARVVEGRASAVRELRRGLRGHRAETEFRGFDVTNSMDVGGFLGWSGPEAAAAAAALSAAAAARLDDSDDDGFFEQSSLGDSDTSRADDTGATALSVPTRPIPRPPRRPLVTAGVCFQSLPGSGCEWWSWREAVVSRSTAAAARGADVKEGARAVLAAAGALPSPAAFDAVLAAAAEQTHDVSRLFLSALFLANAGSIEIVPGAPLSLNSFGIRVLSRDERLYLSVVEDRPPPR-