Convert page id to string explicitly

This commit no longer exists in the repository. It may have been part of a branch which was deleted.This commit has been deleted in the repository: it is no longer reachable from any branch, tag, or ref.

Description

Convert page id to string explicitly

Otherwise, when a line such as

{"dump_talk_page_title": "U-597", "talk_page_id": 4294172, "timestamp": "20140129153907", "project": "marca de projeto", "wp10": "1"}

is processed, we get an error like this:

Traceback (most recent call last):
  File "./utility", line 4, in <module>
    articlequality.main()
  File "/home/he7d3r/projects/articlequality/articlequality/articlequality.py", line 54, in main
    module.main(sys.argv[2:])
  File "/home/he7d3r/projects/articlequality/articlequality/utilities/fetch_text.py", line 48, in main
    run(labelings, output, session, verbose)
  File "/home/he7d3r/projects/articlequality/articlequality/utilities/fetch_text.py", line 53, in run
    for labeling in fetch_text(session, labelings, verbose=verbose):
  File "/home/he7d3r/projects/articlequality/articlequality/utilities/fetch_text.py", line 89, in fetch_text
    labeling['talk_page_id'] + " " + labeling['timestamp'])
TypeError: unsupported operand type(s) for +: 'int' and 'str'
Makefile:553: recipe for target 'datasets/ptwiki.labeled_revisions.with_text.9k_2020.json' failed
make: *** [datasets/ptwiki.labeled_revisions.with_text.9k_2020.json] Error 1

This is a curious example, where the talk page was created BEFORE the content page:

Details

Provenance
He7d3rAuthored on May 20 2020, 7:57 PM
ChangeId
None

Commit No Longer Exists

This commit no longer exists in the repository.