开发者

dynamically calculate number of processes to be spawned

开发者 https://www.devze.com 2023-04-11 08:37 出处:网络
I have a list of about 15 years in the year_queue, I need to spawn one process for each year. But depending on which server I am running the code, the number of processors vary. How do I dynamically v

I have a list of about 15 years in the year_queue, I need to spawn one process for each year. But depending on which server I am running the code, the number of processors vary. How do I dynamically vary the variable num_processes depending on the number of processers in the server?

If I set num_processes > number of processers, would it automatically spawn accordingly? When I test this - it creates 15 processes & splits the CPU power between them. I am looking for a way to first create 'n' number of processes, where n = number of p开发者_JAVA技巧rocessers in the server, and then as each of those processes finish, the next is spawned.

for i in range(num_processes):
    worker = ForEachPerson(year_queue, result_queue, i, dict_of_files)
    print "worker spawned for " + str(i)
    worker.start()

results = []
while len(results) < len(years):
    result = result_queue.get()
    results.append(result)

Anyone had the same issue?


while year_queue.empty() != True:
    for i in range(num_processes):
      worker = ForEachPerson(year_queue, result_queue, i, dict_of_files)
      print "worker spawned for " + str(i)
      worker.start()

    # collect results off the queue
    print "results being collected"
    results = []
    while len(results) < len(num_processes):
      result = result_queue.get()
      results.append(result)


Use a multiprocessing Pool. The class does all the tedious work of selecting the right number of processes and running them for you. It also doesn't spawn a new process for each task, but reuses processes once they're done.

def process_year(year):
    ...
    return result

pool = multiprocessing.Pool()
results = pool.map(process_year, year_queue)


from multiprocessing import Process, Queue, cpu_count
from Queue import Empty

class ForEachPerson(Process):
    def __init__(self, year_queue, result_queue, i, dict_of_files):
        self.year_queue=year_queue
        self.result_queue=result_queue
        self.i=i
        self.dict_of_files=dict_of_files
        super(ForEachPerson, self).__init__()

    def run(self):
        while True:
            try:
                year=self.year_queue.get()

                ''' Do something '''

                self.result_queue.put(year)
            except Empty:
                self.result_queue.close()
                return

if __name__ == '__main__':
    year_queue=Queue()
    result_queue=Queue()
    dict_of_files={}

    start_year=1996
    num_years=15

    for year in range(start_year, start_year + num_years):
        year_queue.put(year)

    workers=[]
    for i in range(cpu_count()):
        worker = ForEachPerson(year_queue, result_queue, i, dict_of_files)
        print 'worker spawned for', str(i)
        worker.start()
        workers.append(worker)

    results=[]
    while len(results) < num_years:
        try:
            year=result_queue.get()
            results.append(year)
            print 'Result:', year
        except Empty:
            pass

    for worker in workers:
        worker.terminate()
0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号