sourcecred

Commit Graph

Author	SHA1	Message	Date
William Chargin	7f81337d74	Store GitHub data gzipped at rest (#751 ) Summary: We store the relational view in `view.json.gz` instead of `view.json`, taking advantage of the isomorphic `pako` library for gzip encoding and decoding. Sample space savings (note that post bodies are included; i.e., #747 has not been applied): SAVE OLD (B) NEW (B) REPO 89.7% 25326 2617 sourcecred/example-github 82.9% 3257576 555948 sourcecred/sourcecred 85.2% 11287621 1665884 ipfs/js-ipfs 88.0% 20953425 2520358 gitcoinco/web 84.4% 38196825 5951459 ipfs/go-ipfs 84.9% 205770642 31101452 tensorflow/tensorflow <details> <summary>Script to generate space savings output</summary> ```shell savings() { printf '% 7s % 11s % 11s %s\n' 'SAVE' 'OLD (B)' 'NEW (B)' 'REPO' for repo; do file="${SOURCECRED_DIRECTORY}/data/${repo}/github/view.json.gz" if ! [ -f "${file}" ]; then printf >&2 'warn: no such file %s\n' "${file}" continue fi script="$(sed -e 's/^ //' <<EOF repo = '${repo}' pre_size = $(<"${file}" gzip -dc \| wc -c) post_size = $(<"${file}" wc -c) percentage = '%0.1f%%' % (100 (1 - post_size / pre_size)) p = '% 7s % 11d % 11d %s' % (percentage, pre_size, post_size, repo) print(p) EOF )" python3 -c "${script}" done } ``` </details> Closes #750. Test Plan: Comparing the raw old version with the decompressed new version shows that they are identical: ``` $ <~/tmp/sourcecred/data/sourcecred/example-github/github/view.json \ > shasum -a 256 - 63853b9d3f918274aafacf5198787e18185a61b9c95faf640a1e61f5d11fa19f - $ <~/tmp/sourcecred/data/sourcecred/example-github/github/view.json.gz \ > gzip -dc \| shasum -a 256 63853b9d3f918274aafacf5198787e18185a61b9c95faf640a1e61f5d11fa19f - ``` Additionally, `yarn test --full` passes, and `yarn start` still loads data and runs PageRank properly. wchargin-branch: gzip-relational-view	2018-09-01 10:42:30 -07:00
William Chargin	d4202b2304	Add a configurable feedback URL to prototype (#715 ) Summary: We can now set, at build time, a URL to be displayed at the top of the prototype, encouraging users to provide feedback. If the URL is not provided, it defaults to the appropriate topic on the SourceCred Discourse instance. The result looks like this: ![Screenshot of the feedback URL in the prototype][screenshot] [screenshot]: https://user-images.githubusercontent.com/4317806/44814824-a238b380-ab92-11e8-88c8-dfbae27ca496.png Test Plan: Unit tests added to `yarn sharness-full` and `yarn unit`. You can run `yarn start` to see the message with the default URL, or `SOURCECRED_FEEDBACK_URL=http://example.com/ yarn start` to specify a custom URL. wchargin-branch: feedback-url	2018-08-29 15:06:12 -07:00
William Chargin	d839fcae95	build_static_site.sh: create target if nonexistent (#695 ) Summary: The current version of the build script has the safe but annoying property that the target directory must be an existing, empty directory. It seems reasonable and convenient to allow the build script to create the directory with `mkdir -p`. It still fails if the directory is not empty or is a file. Test Plan: Unit tests updated; run `yarn sharness-full`. wchargin-branch: build-mkdir-p	2018-08-21 15:35:13 -07:00
William Chargin	4bbbfeebdb	Add a `.gitignore` for Sharness (#673 ) Summary: This is copied from the Sharness repository’s `test/.gitignore`, and is also the same as Git’s `t/.gitignore` minus a Git-specific exclusion. Suggested by @decentralion. Test Plan: Run `yarn sharness` and interrupt it (SIGINT) while `make` is running. Note that `sharness/` now contains a `trash directory.*`, which Git ignores. wchargin-branch: sharness-gitignore	2018-08-15 16:31:00 -07:00
Dandelion Mané	233bec4f5e	Re-disable the Git plugin (#645 ) Thanks to #642, it should now be safe to disable the Git plugin, reaping the benefits described in #628, without causing the cred explorer to crash (#631). Test plan: - `yarn travis --full` passes - The full cred explorer works: - Running PageRank does not crash the explorer - Expanding a pull request does not crash the explorer - (After clearing state) the weight config doesn't show Git weights - The filter doesn't show Git nodes	2018-08-10 19:44:02 -07:00
William Chargin	3eb2b6eec6	Add a favicon (#637 ) Summary: In addition to the obvious benefit of having a favicon, this gets rid of a 404 Not Found error on our home page, tremendously boosting our hacker cred. Test Plan: The favicon is displayed in both `yarn start` and the static site (as a result of the build script). The added build test fails before this change. wchargin-branch: add-favicon	2018-08-10 13:15:49 -07:00
Dandelion Mané	885ff90f62	Revert "Disable the Git plugin (#628 )" (#633 ) This reverts commit `8c70f03122`. Context: This introduced a serious bug (#631), so we're reverting it to get the codebase back in a working state. Meanwhile, I'll develop a principled solution. Test plan: I rebuilt the backend, re-loaded a graph, and loaded it in the frontend. PageRank, the cred explorer, and the weight config all work. Opening a pull request does not trigger a crash.	2018-08-10 11:40:20 -07:00
Dandelion Mané	8c70f03122	Disable the Git plugin (#628 ) See #627 for context. Removing the Git plugin results in an enormous performance improvement. In my testing on `metamask/metamask-extension`: before this change, load took 23s, and PageRank took >9 mins and then crashed. After this change, load+PageRank took 5s combined. Test plan: All unit tests pass; loading new data from the CLI works; and I poked around the UI to make sure there were no spurious references to the Git plugin. Note: This does not break backcompat, there's no need to regenerate any already-loaded data.	2018-08-09 23:08:01 -07:00
William Chargin	c02a81ff43	Expose a cache directory to plugins at load time (#616 ) Summary: The `node ./bin/sourcecred.js load` command invokes plugin code by providing an output directory into which the plugin may store data. As of this patch, it also provides a cache directory that the plugin may use to store data that will not be available at runtime. For instance, the Git plugin might choose to clone the repository herein, or the GitHub plugin may choose to store partial GraphQL query results to deal with interruptions. The contract is that the cache directory may be removed at any time and that the plugin should continue to operate normally. Test Plan: The build script has been updated and tested. Reverting the change to the build script causes the newly added test to fail. (Each plugin has a cache directory, though the cache directories are empty for now.) wchargin-branch: create-plugin-cache	2018-08-07 13:10:15 -07:00
William Chargin	d7cb4c65fa	Create script to build static site (#592 ) Summary: Currently, we create the static site and deploy it all at once in `scripts/deploy.sh`. This commit creates a new script that only builds the static site. This has the advantage that it is easier/less scary to change that script (because it can be tested without worrying about deploying to a local test target), and that we can write automated tests for it. Test Plan: Run `yarn sharness`; note that it completes very quickly. Then, in a shell with your GitHub token exported, run `yarn sharness-full`. Expect all tests to pass. For a sanity check, you can run: ```shell outdir="$(mktemp -d --suffix .sourcecred-site)" ./scripts/build_static_site.sh --target "${outdir}" \ --cname sourcecred.io \ --repo sourcecred/example-git \ --repo sourcecred/example-github \ ; (cd "${outdir}" && python -m SimpleHTTPServer) ``` and ensure that <http://localhost:8000/> is as expected. One test case that is not covered is the following: _if_ the actual app somehow tries to emit a `CNAME` file at root, _and_ our script’s logic to catch this is broken, then we will not catch this failure. I’ve tested the logic manually by adding `>"${cname_file}"` after definition of that variable, but I don’t see a good way to test it automatically, without adding flags like `--but-actually-emit-cname-too` to the build. The compound probability of this happening is sufficiently low that this doesn’t bother me. wchargin-branch: build-static-site-script	2018-08-06 13:05:40 -07:00
William Chargin	baa0cbff1b	Add `sharness` for shell-based testing (#597 ) Summary: We will shortly want to perform testing of shell scripts; it makes the most sense to do so via the shell. We could roll our own testing framework, but it makes more sense to use an existing one. By choosing Sharness, we’re in good company: `go-ipfs` and `go-multihash` use it as well, and it’s derived from Git’s testing library. I like it a lot. For now, we need a dummy test file; our test runner will fail if there are no tests to run. As soon as we have a real test, we can remove this. This commit was generated by following the “per-project installation” instructions at https://github.com/chriscool/sharness, and by additionally including that repository’s `COPYING` file as `SHARNESS_LICENSE`, with a header prepended. I considered instead adding Sharness as a submodule, which is supported and has clear advantages (e.g., you can update the thing), but opted to avoid the complexity of submodules for now. Test Plan: Create the following tests in the `sharness` directory: ```shell $ cat sharness/good.t #!/bin/sh test_description='demo of passing tests' . ./sharness.sh test_expect_success "look at me go" true test_expect_success EXPENSIVE "this may take a while" 'sleep 2' test_done # vim: ft=sh $ cat sharness/bad.t #!/bin/sh test_description='demo of failing tests' . ./sharness.sh test_expect_success "I don't feel so good" false test_done # vim: ft=sh ``` Note that `yarn sharness` and `yarn test` fail appropriately. Note that `yarn sharness-full` fails appropriately after taking two extra seconds, and `yarn test --full` runs the latter. Each failure message should print the name of the failing test case, not just the suite name, and should indicate that the passing tests passed. Then, remove `sharness/bad.t`, and note that the above commands all pass, with the `--full` variants still taking longer. Finally, remove `sharness/good.t`, and note that the above commands all pass (and all pass quickly). wchargin-branch: add-sharness	2018-08-06 12:56:25 -07:00

11 Commits