如何访问输出阶段的 Mapper/Reducer 计数器?

时间：2023-05-04

本文介绍了如何访问输出阶段的 Mapper/Reducer 计数器?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着跟版网的小编来一起学习吧！

问题描述

限时送ChatGPT账号..

我在 Mapper 课程中创建了一些计数器:

I have some counters I created at my Mapper class:

(使用 appengine-mapreduce Java 库 v.0.5 编写的示例)

(example written using the appengine-mapreduce Java library v.0.5)

@Override
public void map(Entity entity) {
    getContext().incrementCounter("analyzed");
    if (isSpecial(entity)){
        getContext().incrementCounter("special");
    }
}

(方法 isSpecial 只是根据实体的状态返回 true 或 false，与问题无关)

(The method isSpecial just returns true or false depending on the state of the entity, not relevant to the question)

我想在处理完所有内容后访问这些计数器，在 Output 类的 finish 方法中:

I want to access those counters when I finish processing the whole stuff, at the finish method of the Output class:

@Override
public Summary finish(Collection<? extends OutputWriter<Entity>> writers) {
    //get the counters and save/return the summary
    int analyzed = 0; //getCounter("analyzed");
    int special = 0; //getCounter("special");
    Summary summary = new Summary(analyzed, special);
    save(summary);
    return summary;
}

...但是 getCounter 方法只能从 MapperContext 类，只能通过 Mappers/Reducers getContext() 方法访问.

... but the method getCounter is only available from the MapperContext class, which is accessible only from Mappers/Reducers getContext() method.

如何在输出阶段访问我的计数器?

How can I access my counters at the Output stage?

旁注:我无法将计数器值发送到我的输出类，因为整个 Map/Reduce 是将一组实体转换为另一组(换句话说:计数器不是 Map/减少).计数器仅用于控制 - 我在这里计算它们而不是创建另一个进程只是为了进行计数是有道理的.

Side note: I can't send the counters values to my outputted class because the whole Map/Reduce is about transforming a set of Entities to another set (in other words: the counters are not the main purpose of the Map/Reduce). The counters are just for control - it makes sense I compute them here instead of creating another process just to make the counts.

谢谢.

如何访问输出阶段的 Mapper/Reducer 计数器?

问题描述

推荐答案

相关文章