The IEEE BigData 2026 Cup Challenge on Explainable Suicide Risk Assessment on Social Media is part of the annual Big Data Cup series held under the auspices of the IEEE International Conference on Big Data (https://bigdataieee.org/BigData2026/). This competition introduces a dual-objective design that advances both suicide risk detection and clinical interpretability. Participants are required to: (1) identify the suicide risk level of a social media post (40%), and extract from the original text phrases or short clauses that serve as evidence for the risk level judgment (30%); (2) identify suicide factors in the post (30%). The top 8 teams will be invited to submit papers describing their solutions. Accepted papers will be presented at the IEEE BigData 2026 conference (Phoenix, Arizona, USA, December 14–17, 2026).
The topic of this year's competition is explainable suicide risk assessment from Reddit posts, with a focus on both suicide risk level detection and clinical evidence extraction. The dataset contains Reddit posts collected from mental health communities ("r/SuicideWatch" in Reddit), annotated by trained annotators following established clinical risk assessment protocols grounded in the Columbia-Suicide Severity Rating Scale (C-SSRS).
During the competition, participants upload formatted predictions file to a live leaderboard for real‑time feedback. Near the competition end, participants upload a solution report (report format: IEEE conference template) and solution code via a provided Google Drive link. The organizers then perform a final evaluation on a separate dataset and assess report quality. Invitations are based on final performance of solution, and report quality (approach innovation, experiments, writing). The final ranking determines the top 8 teams, who will be invited to extend their work for submission and publication in the IEEE BigData 2026 proceedings (subject to Organizing Committee and PC members review). Authors of accepted papers will be invited to present at the conference.
This challenge consists of two subtasks. Participants may compete in both subtasks. The final composite score is weighted 70% on Subtask 1 and 30% on Subtask 2.
Subtask 1: Suicide Risk Detection (Weight: 70%)
Given a Reddit post, identify the author's suicide risk level into one of four categories:
Score = Subtask1 (0.4*suicide risk detection + 0.3*evidence support for detection) + Subtask2 (0.3*suicide factors identification)
.csv file with the following structure. The file name must be YourTeamName.csv.
| Field | Type | Description |
|---|---|---|
| row_id | string | The post identifier from the dataset |
| risk_level | string | One of: Indicator, Ideation, Behavior, Attempt |
| evidence | string | Semicolon-separated evidence text spans that support the predicted suicide risk level (e.g., want to kill myself; feel hopeless). Each span should be copied verbatim from the original post. |
| factors | list | List of factor categories |
Please submit the prediction file created by your team. Multiple submissions are permitted during the evaluation phase (up to 3 per day). The file format should be .csv, and the file name must be: YourTeamName.csv. Scores of uploaded prediction results will be updated on the leaderboard in real time. For a detailed explanation of the submission format, please refer to the 'Task Description' section above.
Based on the submitted final solution, teams will be evaluated according to the following selection criteria:
The top 8 teams will be invited to submit a paper describing their solution (up to 10 page IEEE 2-column conference format, reference pages counted in the 10 pages) for the IEEE BigData 2026 proceedings and to present at the IEEE BigData 2026 conference (Phoenix, Arizona, December 14–17, 2026). Certificates of achievement will be issued to all teams.
Paper submission system: https://bigdataieee.org/BigData2026/ (link to be updated when the chairs open the proceedings submission portal).
Review policy: single-blind review.
The leaderboard will be updated in real time during the evaluation phase (starting June 1, 2026). Results below show the composite score (70% Subtask 1 + 30% Subtask 2).
| Rank | Team Name | Subtask 1 | Subtask 2 | Composite Score |
|---|---|---|---|---|
| 1 | HelloWorld | 0.4346 | 0.2292 | 0.3730 |
Once you have read and accepted the Data Usage Agreement below, please send your team's information to the registration email address in the following format. We will respond with the dataset download link.
We accept the Competition Data Usage Agreement
For registration and general inquiries, contact Alex at hialex.li@connect.polyu.hk