Abstract：Multi-programmed execution could exhibit higher performance with regard to single-threaded execution on multi-core platform, which brings multi-programmed execution into common scenario. How to schedule multiple tasks effectively in order to avoid the bottleneck of on-chip resource usage is an important issue. This paper proposed a memory load balance mechanism against on-chip memory resource underutilization. We implemented an integrated computing and memory load co-balance system via characterizing memory load and kernel level performance monitoring, which could schedule tasks at kernel level. In comparison with the default load balance scheduler, experimental results showed that memory load balance scheduling system could improve multi-core system throughput at 8.2%, and have negligible costs.