lightrft.datasets.imagegen_cot_reward¶

class lightrft.datasets.imagegen_cot_reward.ImageGenCoTRewardGRMHandler[source]¶

Data handler for ImageGen-CoT-Reward-5K dataset. For Text-to-Image generation task.

get_media_info(item: Dict[str, Any]) → Dict[str, Dict[str, str]][source]¶

Extract path info for the two images.

Example:

info = handler.get_media_info(item)

load_data(path: str) → List[Dict[str, Any]][source]¶

Loads data from json file.

Example:

handler = ImageGenCoTRewardHandler()
data = handler.load_data("path/to/ImageGen-CoT-Reward.json")

parse_item(item: Dict[str, Any], media_content: Dict[str, Any], config: Dict[str, Any] | None) → Tuple[List[Dict], Dict][source]¶

Parse a single ImageGen-CoT-Reward item into message pairs.

Parameters:

item (Dict[str, Any]) – Raw data item from ImageGen-CoT-Reward dataset.
media_content (Dict[str, Any]) – Loaded image content (PIL images/bytes)
config (Dict[str, Any]) – Configuration for max_pixels

Returns:

A tuple of (messages, metadata)

Return type:

Tuple[List[Dict], Dict]

Example:

messages, other = handler.parse_item(item, media_content, config)