sourcecred

Commit Graph

Author	SHA1	Message	Date
Dandelion Mané	1d32141f76	Fix `sourcecred load` for real repos (#1164 ) As discussed in #1163, #1162 caused `sourcecred load` to start failing for real repos (e.g. sourcecred/research). This commit merges a somewhat hacky fix. Test plan: `yarn test` passes, and `sourcecred load sourcecred/research` works.	2019-05-31 20:16:58 +03:00
Tyler Mace	1fbf8cd587	Quicker failure and description when invalid token supplied (#1161 ) Fixes #1156 When users export a GitHub API token that has insufficient privleges or has been revoked, we have been using a catch all error with retry to handle it. This change adds a new error type for bad credentials and does not retry. Test plan: There are no unit tests that cover this, however, you can test the change by supplying a revoked token and attempting to load a GitHub repo.	2019-05-30 22:18:30 +03:00
Dandelion Mané	ad2470e5c6	Aggregate timestamp information on sourcecred load (#1162 ) This modifies `sourcecred load` so that it saves timestamp information for all of the loaded plugins in a single aggregated map. This is quite convenient, as it saves consumers of timestamp information from needing to worry about the (rather hacky) implementation whereby the data is fed from each adapter. Instead, consumers can just load the timestamp map. This will also make it much easier to use timestamp info in the research codebase. Test plan: The timestampMap module has testing around generating the map from the adapter and nodes, writing it, and reading it. I haven't added any testing to the `load` CLI command. I think it would be redundant as the updated snapshot test reveals that the map is getting serialized properly. Tests pass, and I have inspected the snapshot	2019-05-30 17:15:15 +03:00
Dandelion Mané	4dc97fcc57	PagerankGraph reuses existing distribution (#1160 ) This commit modifies `PagerankGraph.runPagerank` so that rather than always starting from a uniform distribution, it starts with the PagerankGraph's existing score distribution. The PagerankGraph initializes with a uniform score over nodes, so it has the exact same behavior on the first time that runPagerank is called. On subsequent calls, PageRank will likely converge a lot faster, because it's starting from converged scores. (It should still be a lot faster in cases where e.g. the user has tweaked the weights.) In certain degerate cases, this could change the resultant scores. Specifically, if there are isolated nodes in the graph and alpha=0, then the isolated nodes' final scores depends on the initial score. In general, I think this won't be an issue as we expect alpha > 0 in normal usage. Test plan: I added a unit test to verify this property, by checking that running PageRank with maxIterations==0 on an already-converged graph results in a still-converged graph. Also, existing tests pass. I think we can now close #1020.	2019-05-29 20:29:28 +03:00
Dandelion Mané	14eee06799	Add a universal snapshot updater (#1159 ) As SourceCred has evolved, we've grown more and more snapshot tests that are not included in Jest. The GitHub plugin has two ad-hoc snapshot tests, the Git plugin has one, and the sharness test suites have one. This makes it difficult to keep track of where to update snapshots when core changes are made. To fix this, I've added a script, `scripts/update_snapshots.sh`, which updates snapshot tests across the project. Test plan: I removed existing snapshots across the codebase, ran the snapshot tester, and they correctly regenerated.	2019-05-28 18:59:50 +03:00
Dandelion Mané	e01247a642	Expose `createdAt` in `AnalysisAdapter` (#1157 ) * Refactor Loader from AnalysisAdapter At present, the only data the AnalysisAdapter provides is the Graph, so the AnalysisAdapter has a `load` method which directly returns the graph. I'm planning to add a `createdAt` getter to the adapter as well, which also will depend on loading the data. To make this change convenient, I'm starting by refactoring an AdapterLoader out, which manages loading data from disk, so that once we have an AnalysisAdapter, it already has all relevant data loaded. Then, it will be much easier to add a `createdAt` method. Test plan: Tests updated, flow passes. * Add `createdAt` to the analysis adapter A big step forward for time-varying cred. This will make `createdAt` timestamps available for PageRank analysis. Test plan: Added new unit tests. Inspect the snapshots. Run `yarn test`.	2019-05-28 18:03:44 +03:00
Dandelion Mané	da5bce255e	Add authorDate tracking to Git commits (#1153 ) Modifies the Git plugin so that we now track commit author dates. Similar to in #1152, they are encoded in MsSinceEpoch. Test plan: `yarn test --full` passes, except for the pre-existing failure discussed in #1151. Thanks to @s-ben for a conversation which motivated these changes.	2019-05-21 06:56:57 +03:00
Dandelion Mané	d51c0b6715	Add createdAt timestamp tracking to GitHub (#1152 ) Updates github schema to include createdAt timestamps, and then updates the RelationalView to provide those timestamps as MsSinceEpoch. I added createdAt timestamps to Repos, Issues, Pulls, Reviews, and Comments, as these correspond to GitHub graph nodes where I think time-based filtering is relevant. I didn't add them to Users, Reactions, or Commits. Reactions, because they correspond to edges not nodes. (We could consider doing the time filtering on edges too, but I'd rather keep it simple for now.) Commits, because they're owned by a different plugin. Users, because... in a certain sense the user identity is timeless, the time factoring is mostly so we can evaluate how users' cred varies over time. Anyway, it will be easy to add more fields later if we need them. Test plan: - Inspect snapshot changes - Ran `yarn test --full` - Its only failure is pre-existing, per #1151 Thanks to @s-ben for some motivation and discussion about this change.	2019-05-21 06:16:16 +03:00
Dandelion Mané	a831e05e5f	Add a WeightsFileManager (#1150 ) This adds a WeightsFileManager component that allows the user to save or load weights in the cred explorer. Clicking the download icon downloads the weights, clicking the upload icon uploads them. I also did a slight refactor to the FileUploader so that it no longer always provides the file upload icon, instead the instantiator passes children which act as the upload clickable. Seemed more consistent. Test plan: No tests added, but I manually tested that upload and download both work.	2019-05-21 04:41:00 +03:00
Dandelion Mané	fb89559e44	Add a FileUploader component, with inspection test (#1149 ) * Add FileUploader with inspection test TODO: get it working * Add a FileUploader component, with inspection test This adds a FileUploader component, which allows the user to upload JSON files. Rather than using automated testing, it has an inspection test. The inspection test may be run by navigating to: http://localhost:8080/test/FileUploader/ This commit also adds some basic utility functions for defining inspection tests to `routeData.js`. We should improve support for inspection tests in the future; see [#1148]. [#1148]: https://github.com/sourcecred/sourcecred/issues/1148 Test plan: Ran the included inspection test.	2019-05-20 17:12:57 +03:00
Dandelion Mané	f6a4042342	Add JSON serialization support for weights (#1147 ) Test plan: New unit tests added.	2019-05-20 14:12:12 +03:00
Dandelion Mané	03a8ab679b	Refactor the Weights module (#1146 ) This commit refactors the `analysis/weights` module so that there's a single top-level type called `Weights` which includes all of the user-specified weights for running PageRank. This includes both manually specified type weights, and per-node weights. In contrast to the old weights module which had weighted types (i.e. the type bundled with the weight), this only records the weight choice (keyed by address). Similarly, the map is empty unless the user has explicitly chosen a weight. This has a few advantages: - For planned work on serializing weights, it's convenient that there's a single unified object that bundles all the weights. - For planned work on serializing weights, it's convenient that we only record actual choices, not the default values (defaults are provided within the WeightsToEdgeEvaluator). This way, we don't need to manually filter out the default weights for serialization. (We want users who have not made any choice about the type weights to automatically get new type weightings we publish in sourcecred/sourcecred). - Overall, the WeightConfig data piping has become a lot simpler. While making these changes, I threw away the PluginWeightConfig component, instead inlining it in the top-level WeightConfig. I also moved responsibility for the WeightConfig out of the PagerankTable, and into the containing App; the WeightConfig is created and wired by the App, and then passed as a whole component into the PagerankTable via props. This means that PagerankTable does not need to get a bunch of extra props for piping the state changes into WeightConfig. I also threw away some frontend test code for the WeightConfig component. Over the course of the past few months, I've observed that the value of a lot of the frontend-wiring-testing is pretty low (i.e. they are not catching issues, because Flow catches them) and the cost of maintaining the testing is high. I'm now inclined to believe that we shouldn't be testing routine component wiring via explicit tests, because it's very easy to verify that the behavior is correct by interacting with the UI, and the tests are expensive to write and maintain. For "core logic" (e.g. behaviors of types and data structures, algorithms, etc) I continue to believe testing is very valuable. One heuristic I'll start applying is: "can I achieve high confidence that this code is correct, without tests, using little effort by manually inspecting the frontend?". If the answer is yes, I'm unlikely to write tests. Test plan: `yarn test` passes, and the UI continues to function, including changing manual and type level weights and observing cred changes.	2019-05-20 13:46:08 +03:00
Dandelion Mané	1ea3c1ec94	nit: remove fallbackDeclaration.js In the previous commit, I failed to remove this file. This is a cleanup. Test plan: `yarn test`	2019-05-19 15:50:03 +03:00
Dandelion Mané	467b781788	Remove the fallback adapter (#1145 ) Node and Edge types are increasingly important in SourceCred, as we use them to decide what weights to provide, what description logic to use, etc. In #631, we hit a bug around not finding a type matching a particular node, and in #640 I responded by creating a "FallbackAdapter", which matches any node, and a "Static/DynamicAdapterSet", which are abstractions for finding adapters in a way that guarantees the presence of the fallback adapter. I think this solution is actually pretty brittle. It only works if we are disciplined in only ever accessing node and edge types using a context that included the FallbackAdapter, and it requires us to centralize all of the "type-not-found" logic in one place (the fallback declaration) irrespective of what the locally appropriate solution is. Looking back at #631, the core issue was that we had a getter that promised to always give a matching type and adapter, rather than returning a possibly undefined adapter. We fixed this by attempting to ensure that there always would be a matching adapter. I think it would be better to have the methods honestly report that the adapter might be null/undefined, and let the client code handle it appropriately. This commit implements that change. It's motivated by me being frustrated at the fallback adapter while doing other refactoring. Test plan: Unit tests have been updated and `yarn test` passes. Also, I experimentally removed the Git plugin from the list of adapters for both the backend and frontend, and verified that the frontend UI and the pagerank and export-graph commands still worked.	2019-05-19 15:49:24 +03:00
Dandelion Mané	9d8d223653	Weights: Refactor around a new EdgeWeight type (#1144 ) This refactors the weights module (and downstream consumers) so that rather than tracking forwardWeight and backwardsWeight as separate fields, we have an `EdgeWeight` type with a forwards and backwards property. I feel this makes the code a little cleaner and wanted it a few times while doing other refactors in this module. Note that the graphToMarkovChain module has a distinct `EdgeWeight` type which serves a similar purpose but happens to have different field names. I've added a TODO to rename those fields for consistency. Test plan: Since this is slight data structure re-organization, no new tests are required; that `yarn test` passes is sufficient.	2019-05-19 14:17:46 +03:00
Dandelion Mané	d2559960bb	explorer: tweak weights on a per-node basis (#1143 ) This pull request adds a weight slider to every NodeRow in the explorer, enabling the user to manually set a weight for that node. The weights are multiplicative with the type level weights, so that they can be changed independently (e.g. you can have a comment that is weighted 2x higher than regular comments, but still have comments get a low weight in general). This pull coordinates a number of different changes across the codebase, all of which are tested: Adding support for manual weights in the weights and weightsToEdgeEvaluator modules. Modifying pagerankTable.TableRow so that it can show a slider in the second column. Adding piping for manual weights into the PagerankTable shared props, and into the explorer app Adding the slider to the NodeRow class that displays the current weight, and can trigger the upstream weight change Ensuring that the runPagerank call in the explorer actually uses the manual weights At present, there is no way to save these weights (they are ephemeral in the frontend) and so this is clearly a prototype/tech demo level feature rather than being ready for real usage. Correspondingly, CLI pagerank command always uses an empty set of manual weights. I plan to remedy this in a follow-on pull request. Test plan: Run the included unit tests (yarn test) and also spin up the UI, verify that it visually looks good in both Firefox and Chrome, and verify that changing the weights and then re-running PageRank actually causes the cred of the modified node to change. Review plan: In addition to carefully reading the code, ensure that all of the changes described a few paragraphs up are actually tested. Merge plan: Squash and merge. Thanks to @s-ben for proposing this feature in Discord, and to everyone discussing its implications in this Discourse thread.	2019-05-18 19:21:17 +03:00
Dandelion Mané	2999d24593	Fix a conflict between #1140 and #1141 (#1142 ) In #1140 I rename a field in PagerankGraph.ScoredNode, and in #1141 I added a new test which referenced the field. I merged them in quick succession, introducing a conflict. This PR fixes it. Test plan: `yarn test` passes.	2019-05-17 17:11:22 +03:00
Dandelion Mané	f046dd06a5	Add `setEdgeEvaluator` to PagerankGraph (#1141 ) This commit adds a `setEdgeEvaluator` method to `PagerankGraph`, and modifies the constructor to use that method. This will allow us to use PagerankGraph in the explorer UI, so that we can update edge weights without fully regenerating the graph. Test plan: I've added new unit tests that verify basic properties of how the edge weights are getting set and consumed.	2019-05-17 17:07:18 +03:00
Dandelion Mané	f8c659a413	Rename `ScoredNode.node -> ScoredNode.address` (#1140 ) It makes more sense that a ScoredNode has an address than that it has a sub-node (which is an address). Test plan: It's a simple field rename; `yarn test` suffices	2019-05-17 17:07:00 +03:00
Brian Litwin	0f038305a2	Add CLI command to clear sourcecred data directory (#1111 ) Resolves #1067 Adds the CLI commands: `sourcecred clear --all` -- removes the $SOURCECRED_DIRECTORY `sourcecred clear --cache` -- removes the cache directory `sourcecred clear --help` -- provides usage info `sourcecred clear` -- prompts the user to be more specific Test plan: The unit tests ensure that the command is properly wired into the sourcecred CLI, including help text integration. However, just to be safe, we can start by verifying that calling `sourcecred` without arguments lists the `clear` command as a valid option, and that calling `sourcecred help clear` prints help information. (Note: it's necessary to run `yarn backend` before testing these changes) The unit tests also ensure that the command removes the proper directories, so there isn't really a need to manually test it, although the reviewer may choose to do so to be safe. Although out of scope for unit tests on this function, we can also do integration tests, to make sure that running the clear command doesn't leave the sourcecred directory in an invalid state from the perspective of the `load` command. ```js $ yarn backend; $ node bin/sourcecred.js load sourcecred/example-github; $ node bin/sourcecred.js clear --cache; $ node bin/sourcecred.js load sourcecred/example-github; $ node bin/sourcecred.js clear --all; $ node bin/sourcecred.js load sourcecred/example-github; ``` The expected behavior of the above command block is that the load command never fails or throws an error. @decentralion and I discussed the scenario where `rimraf` errors. We decided that testing this scenario wasn't necessary, because `rimraf` doesn't error if a directory doesn't exist, and rimraf's maintainer suggests [monkey-patching the fs module] to get rimraf to error in testing scenarios. Thanks @decentralion for reviewing and pair-programming this with me. [monkey-patching the fs module]: https://github.com/isaacs/rimraf/issues/31#issuecomment-29534796	2019-05-13 12:59:58 +03:00
Dandelion Mané	bed476517c	Port skeleton of Odyssey frontend (#1132 ) This commit integrates an bare skeleton of the odyssey frontend that we implemented in the [odyssey-hackathon] repository. You can see the working frontend that we are trying to port over at [sourcecred.io/odyssey-hackathon/][scio]. The prototype in the other repository has some tooling choices which are incompatible/redundant with decisions in our codebase (sass vs aphrodite), and requires some tools not yet present here (svg-react-loader). This commit includes the build and integration work needed to port the prototype frontend into mainline SourceCred. The frontend scaffold isn't yet integrated with any "real" Odyssey data. One potential issue: right now, every page that is rendered from the SourceCred homepage is contained within a [homepage/Page], meaning that it has full SourceCred website styling, along with the SourceCred website header. The [application][scio] also has a header. Currently, I work around this by having the Odyssey UI cover up the base header (via absolute positioning), which works but is hacky. We can consider more principled solutions: - Finding a way to specify routes which aren't contained by [homepage/Page]; maybe by adding a new top-level route [here][route-alternative]. - Unify the headers for the Odyssey viewer and the page as a whole (sounds like inappropriate entanglement?) - Have a website header and also an application header (sounds ugly?) [homepage/Page]: `ee1d2fb996/src/homepage/Page.js` [route-alternative]: `ee1d2fb996/src/homepage/createRoutes.js (L17)` Test plan: Run `yarn start`, and then navigate to `localhost:8080/odyssey/`. observe that a working website is displayed, and that the cred logo next to the word "SourceCred" is loaded properly (i.e. svg-react-loader is integrated properly). Observe that there are no build/compile errors from either `yarn start` or `yarn build`. Also, observe that the UI looks passably nice, and that if the number of elements in the entity lists is larger than can be displayed, the sidebar pane scrolls independently. The UI was tested in both Chrome and Firefox. [odyssey-hackathon]: https://github.com/sourcecred/odyssey-hackathon [scio]: https://sourcecred.io/odyssey-hackathon/ Thanks to @jmnemo, as the implementation is based on [his work]. [his work]: https://github.com/jmnemo/hackathon-event/	2019-05-06 18:15:39 +03:00
Dandelion Mané	9231085185	Initial data model for the Odyssey plugin (#1134 ) This commit puts in a basic data model for the Odyssey plugin. It's built around the `OdysseyInstance` class, which is basically a Graph that keeps around descriptions for every node, and ensures that nodes/edges are typed in accordance with the Odyssey plugin declaration. In the future, I want to enable instances to declare their own node/edge types, in which case the instance will assume responsibility for tracking and serializing the types. To make the Odyssey plugin a 'proper plugin', I've also added a plugin declaration, as well as analysis and explorer adapters. I haven't decided exactly where data for Odyssey instances should be stored, so for now the plugin adapters always return an example instance which is based on our experience at the Odyssey hackathon. Test plan: The instance has unit tests for its logic. If you want to see what the plugin looks like right now when it's integrated, you can apply the following diff, and then load the prototype. It will contain Odyssey nodes (find them using the node type dropdown). Note that without a seed vector to move cred back to the values/artifacts, the cred distribution in the Odyssey subgraph is degenerate; the users are all sinks and have postiive cred scores, but all the other nodes converge to 0 cred. diff --git a/src/homepage/homepageExplorer.js b/src/homepage/homepageExplorer.js index cae4548..48987f1 100644 --- a/src/homepage/homepageExplorer.js +++ b/src/homepage/homepageExplorer.js @@ -6,6 +6,7 @@ import type {Assets} from "../webutil/assets"; import {StaticExplorerAdapterSet} from "../explorer/adapters/explorerAdapterSet"; import {StaticExplorerAdapter as GithubAdapter} from "../plugins/github/explorerAdapter"; import {StaticExplorerAdapter as GitAdapter} from "../plugins/git/explorerAdapter"; +import {StaticExplorerAdapter as OdysseyAdapter} from "../plugins/odyssey/explorerAdapter"; import {GithubGitGateway} from "../plugins/github/githubGitGateway"; import {AppPage} from "../explorer/App"; import type {RepoId} from "../core/repoId"; @@ -14,6 +15,7 @@ function homepageStaticAdapters(): StaticExplorerAdapterSet { return new StaticExplorerAdapterSet([ new GithubAdapter(), new GitAdapter(new GithubGitGateway()), + new OdysseyAdapter(), ]); }	2019-05-06 11:54:07 +03:00
Dandelion Mané	79017a477b	Add support for seed vectors to PagerankGraph (#1135 ) This commit modifies `PagerankGraph.runPagerank` so that the user can provide an alpha and seed vector. The seed vector is specified via a map of weights, which will be normalized into a probability distribution over all the nodes in the graph. In the event that the map is empty (or the total weight is otherwise 0), a uniform distribution is created. To effect this change, a helper function called `weightedDistribution` has been added (and thoroughly tested) in the `graphToMarkovChain` module. Then, that function is used in `pagerankGraph.runPagerank` (along with light testing). Currently, the default alpha is set to 0, to ensure consistency with the legacy pagerank implementation in `analysis/pagerank`. Once that has been replaced with `PagerankGraph`, we can consider changing the defualt alpha to non-zero (thus removing the need for synthetic self-loops). I took a different approach in the [odyssey-hackathon repo][commit]. The previous approach was a much more complicated (and fairly redundant) API, that allowed specifying "NO_SEED", "UNIFORM_SEED", "SELECTED_SEED", and "SPECIFIED_SEED". I'm much happier with this API and implementation. [commit]: `ed07861073` Test plan: Unit tests included; run `yarn test`.	2019-05-05 18:57:41 +03:00
Dandelion Mané	e7bc025379	Add support for PageRank Seed Vectors (#1128 ) Summary: the cred calculation is defined by a Markov Mixing process. By introducing the seed vector and teleportation parameter alpha, the Markov mixing process is augmented with a source of cred originating from the seed vector. The resulting algorithm is the generalized variation of PageRank, allowing computation of both canonical PageRank where the seed vector is the uniform distribution and personalized PageRank where the seed vector is an indicator distribution. It is still possible to get the simple markov chain solution by setting alpha = 0. Note that this changes the Markov process state update, but does not provide updates to the APIs. All existing behavior is unchanged because alpha is always set to 0. This is a port of https://github.com/sourcecred/odyssey-hackathon/pull/3, which was created during the Odyssey Hackathon. Test Plan: Existing tests have been extended to include passing alpha = 0 to reproduce exisiting test cases for the simple Markov Process. Additional test cases include - Verifying that resulting stationary distribution is unaffected by seed when alpha = 0 - Verifying that resulting stationary distribution is precisely equal to seed when alpha = 1 - Verifying that the resulting stationary distribution is linear in the seed vector - Verifying that the correct stationary distribution is computed for non-zero alpha - verify that the algorithm converges immediately when the initialDistribution is the stationary distribution - verify that the changing the initialDistribution does not change the stationary distribution Paired with @mzargham	2019-04-24 16:37:16 +03:00
Dandelion Mané	ee1d2fb996	Make PagerankGraph convergence options optional (#1131 ) Right now PagerankGraph requires that the user choose specific values for maxIterations and convergenceThreshold when running PageRank. I also rename `PagerankConvergenceOptions` to `PagerankOptions`. The motivation is that I want to add future arguments to the same options dict (e.g. alpha and the seed vector), so the rename is appropriate, and allowing the options to be unset (and thus inherit default values) will make the whole API much cleaner as I add more options. Test plan: Unit test added. `yarn test` passes.	2019-04-21 15:35:09 +03:00
Dandelion Mané	a8a3f4fc3a	refactor args to findStationaryDistribution (#1130 ) In [#1128: Add support for seed vectors][#1128], we significantly increase the number of arguments to markovChain.findStationaryDistribution. To clean up the invocations, I added a followon PR (#1129) which converts findStationaryDistribution to use a `PagerankParams` object instead. However, I think it will be cleaner to land the PagerankParams refactor before adding new features in #1128, so I'm making this PR as pre-cleanup. Test plan: This is a trivial refactor. `yarn test` passes. [#1128]: https://github.com/sourcecred/sourcecred/pull/1128	2019-04-21 14:00:30 +03:00
Dandelion Mané	6dd58a9c67	Use aphrodite for HomePage.js styling (#1127 ) This is a minor refactor so that we use Aphrodite for styling on HomePage.js. It's not super consequential, but I want to switch to using Aphrodite more consistently in the codebase, so why not start here. Test plan: `yarn test` reveals no errors. `yarn start` launches a correctly styled frontend. I also used `build_static_site.sh` and the resultant site is also correctly styled.	2019-04-19 17:10:38 +03:00
Dandelion Mané	7efcc13618	Automatically run pagerank on `sourcecred load` (#1115 ) This commit updates the `sourcecred load` command so that it also automatically runs PageRank on completion. The implementation is slightly hacky, in that it prints two sets of task status headers/footers to console, for reasons described in a comment in the source code. The user-visible effect of this hack can be seen below: ``` ❯ node bin/sourcecred.js load sourcecred/example-github Starting tasks GO load-git GO load-github DONE load-github DONE load-git Overview Final result: SUCCESS Starting tasks GO run-pagerank DONE run-pagerank Overview Final result: SUCCESS ``` It would be good to clean this up, but for now I think it's acceptable. Note that it is not safe to assume that a PagerankGraph always exists for repos that are included in the RepoIdRegistry. The repo gets added to the registry before the pagerank task runs. Consumers that are loading the `PagerankGraph` can just check that the file exists, though. Test plan: I've added unit tests that verify that the right tasks are generated. Most importantly, the snapshot of the results of `sourcecred load` now include a snapshotted pagerank graph. (The snapshot was updated via `UPDATE_SNAPSHOT=1 yarn test --full`.) Further progress on #967.	2019-04-11 21:21:29 +02:00
Dandelion Mané	fb6c9e1ba0	Make tests use SOURCECRED_GITHUB_TOKEN (#1124 ) Across SourceCred usage, we depend on the `SOURCECRED_GITHUB_TOKEN` environment variable being set. Confusingly, some tests expect `GITHUB_TOKEN` instead of `SOURCECRED_GITHUB_TOKEN`. This commit resolves that inconsistency, by having all tests that read from the environment use `SOURCECRED_GITHUB_TOKEN`. This was already available as a secret in our CI configuration, so no change is needed there. (After this merges, we may remove the GITHUB_TOKEN variable from the environment.) Test plan: `yarn test --full` passes without the environment variable set. Also, the following grep produces only innocuous hits. ``` git grep -P "(?<\!SOURCECRED_)GITHUB_TOKEN" ```	2019-04-11 21:18:24 +02:00
Dandelion Mané	320a69759e	refactor: `load` uses dependency injection (#1123 ) This commit refactors the `sourcecred load` CLI command so that it uses dependency injection, much like the testing setup #1110. This makes it feasible to test "surface logic" of how the CLI parses flags and transforms them into data separately from the "piping logic" of invoking the right API calls using that data. This is motivated by the fact that I have other pulls on the way that modify the `load` command (e.g. #1115) and testing them within the current framework is onerous. Test plan: This is a pure refactoring commit, which substantially re-writes the unit tests. The new unit tests pass (`yarn test --full` is happy). Note that `yarn test -full` also includes a sharness test that does an E2E usage of `sourcecred load` (see sharness/test_load_example_github.t), so we may be confident that the command still works as intended.	2019-04-11 18:59:08 +02:00
Seth Benton	13a90675a8	Fixes broken link README (#1122 ) This commit fixes three broken links (two in the README, one in the prototype app) that were still pointing to https://discuss.sourcecred.io/. Test plan: Verify that there are no other bad links to the old Discourse location, by running `git grep "discuss.sourcecred.io"`.	2019-03-26 16:01:39 -07:00
Dandelion Mané	012c4f3eb7	Add `sourcecred pagerank` for backend pagerank (#1114 ) This commit adds a new CLI command, `pagerank`, which runs PageRank on a given repository. At present, the command only ever uses the default weights, although I plan to make this configurable in the future. The command then saves the resultant pagerank graph in the SourceCred directory. On its own, this command is not yet very compelling, as it doesn't present any easily-consumed information (e.g. users' scores). However, it is the first step for building other commands which do just that. My intention is to make running this command the last step of `sourcecred load`, so that future commands may assume the existence of pagerank scores for any loaded repository. Test plan: The new command is thoroughly tested; see `cli/pagerank.test.js`. It also has nearly perfect code coverage (one line missing, the dependency-injected real function for loading graphs). Additionally, the following sequence of commands works: ``` $ yarn backend $ node bin/sourcecred.js load sourcecred/pm $ node bin/sourcecred.js pagerank sourcecred/pm $ cat $SOURCECRED_DIRECTORY/data/sourcecred/pm/pagerankGraph.json ``` Material progress on #967.	2019-03-25 18:05:58 -07:00
Dandelion Mané	669f34d009	Add `fetchGithubOrg` for loading organizations (#1117 ) This commit adds a module, `fetchGithubOrg`, which loads data on GitHub organizations, most notably including the list of repositories in that org. The structure of this commit is heavily influenced by review feedback from @wchargin's [review] of a related PR. Test plan: This logic depends on actually hitting GitHub's API, so the tests are modeled off the related `fetchGithubRepo` module. There is a new shell test, `src/plugins/github/fetchGithubOrgTest.sh`, which verifies that that the org loading logic works via a snapshot. To verify the correctness of this commit, I've performed the following checks: - `yarn test --full` passes - inspection of `src/plugins/github/example/example-organization.json` confirms that the list of repositories matches the repos for the "sourcecred-test-organization" organization - manually breaking the snapshot (by removing a repo from the snapshot) causes `yarn test --full` to fail - running `src/plugins/github/fetchGithubOrgTest.sh -u` restores the snapshot, afterwhich `yarn test --full` passes again. [review]: https://github.com/sourcecred/sourcecred/pull/1089#pullrequestreview-204577637	2019-03-19 19:00:08 -07:00
Dandelion Mané	bd8be01958	Refactor loadGraph out of exportGraph (#1113 ) This pulls the logic for loading a SourceCred graph from disk out `cli/exportGraph` and into `analysis/loadGraph`. The rationale is that `exportGraph` is not the only command that wants the ability to load a graph from the analysis adapters. The new command has a clean return signature that reports whether the load was successful, or failed because the graph wasn't loaded, or failed due to an error in plugin code. Testing of the loading logic has been moved to `loadGraph.test`, and the CLI has been refactored so that the loadGraph method is dependency injected. This allows for (IMO) cleaner testing of the CLI method. There is one (deliberate) change in behavior, which is that the command no longer throws an error if no plugins are included; instead it will just export an empty graph. I don't have a strong preference between the two behaviors; changing it was just more convenient. Test plan: New unit tests have been added, and tests of the cli command have been re-written. As a sanity check, I've verified that the following sequence still works: ``` $ yarn backend $ node bin/sourcecred.js load sourcecred/pm $ node bin/sourcecred.js export-graph sourcecred/pm ``` Nearly perfect code coverage is maintained. One line is uncovered, and it's the line that injects in the actual graph loading behavior.	2019-03-13 00:24:09 -06:00
Dandelion Mané	d1936fbf93	PagerankGraph: add neighbors + score decomposition (#1094 ) This commit adds a `neighbors` method to `PagerankGraph`. This is an augmented version of `Graph.neighbors`. It returns the base data from `Graph.neighbors` as well as the score, the edge weights, and the score contribution. The score contribution basically means how much score was contributed from the target node by this particular neighbor connection. When the graph is well-converged, a node's score will be the sum of all its neighbors' score contributions, as well as the contribution it received from its synthetic loop edge. So, for completeness sake, I added another method, `syntheticLoopScoreContribution`, which computes how much score a node received from its synthetic loop edge. (This value should usually be close to 0). You can think of these two methods as providing a replacement for the `PagerankNodeDecomposition` logic. Test plan: I've added tests that verify: - That neighbors returns results consistent with Graph.neighbors - That neighbors' score contributions are computed correctly - That neighbors errors if the graph has been modified - That synthetic score contributions are computed correctly - That a node's score is the sum of all its contributions Test plan: Unit tests included. Run `yarn test`.	2019-03-08 15:02:00 -07:00
Dandelion Mané	441d6df255	Move default pagerank settings to pagerankGraph (#1112 ) This commit moves the default Pagerank options out of `analysis/pagerank` and to `core/pagerankGraph`. This reflects the gradual migration of core pagerank logic into `pagerankGraph`. Test plan: `yarn test` should suffice. It's a trivial change.	2019-03-07 23:04:07 -07:00
Ana Noemi	c48b2cd52e	Node and edge description tooltips (#1081 ) * Show tooltips in weightConfig UI * Updated to pass checks from prettier * Updates unit tests to check WeightSlider descriptions * Update CHANGELOG.md to reflect PR #1081	2019-03-07 18:49:27 +09:00
Dandelion Mané	996899ade3	Add CLI command: `sourcecred export-graph` (#1110 ) * Add CLI command: `sourcecred export-graph` This adds an `export-graph` command to the SourceCred CLI. It exports the combined cred graphs for individual RepoIds, as was done for [sourcecred/research#4]. Example usage: ``` $ node bin/sourcecred.js load sourcecred/mission $ node bin/sourcecred.js export-graph sourcecred/mission > /tmp/mission_graph.json ``` Test plan: The new command is thoroughly unit tested. [sourcecred/research#4]: https://github.com/sourcecred/research/pull/4 * Address review feedback by @wchargin	2019-03-01 15:33:40 -07:00
Dandelion Mané	b561b1728b	refactor repoIdRegistry (#1109 ) This commit makes several improvements to `repoIdRegistry`: - Create `writeRegistry` and `getRegistry` methods to abstract over needing to find the right file, populate an empty registry if its not available, etc. - Create `getEntry` for efficiently checking whether a RepoId is in the registry - Rename `addRepoId` to `addEntry` for consistency - Add docstrings The `load` command has been refactored to use the new methods. Test plan: Unit tests added, and they pass. The `load` command is already thoroughly tested, so regressions are very unlikely.	2019-03-01 11:25:10 -07:00
Brian Litwin	b16c374a2b	pagerankGraph: add edge filter (#1105 ) Part of ongoing work for #1020. Test plan: Added tests that mirror the edge filtering tests in `graph.test` to check that `graph` and `pagerankGraph` return the same edges with the given `EdgesOptions` parameter. Also added a sanity check that a `weight` prop is returned from the iterator along with the edge. Given the dependence on a helper function to test the edge iterator's equality between graphs, I would suggest reviewers give particular attention to that function: `expectConsistentEdges()`	2019-02-27 21:44:21 -05:00
Dandelion Mané	8f6a3f30bd	PagerankGraph: Add `totalOutWeight` (#1092 ) This commit adds a `totalOutWeight` method to `PagerankGraph`. For any given node, `totalOutWeight` reports the total weight traveling away from the node on edges (including the synthetic loop edge). Using totalOutWeight makes it possible to normalize the weights to get the actual markov transition probabilities. Test plan: Unit tests verify the following properties: - An error is thrown if the requested node does not exist. - An error is thrown if the graph has been modified. - The out weights are computed correctly in the standard case. - The out weights are computed correctly in the case where there are no weights (except the synthetic loop weight) - The out weights are still computed correctly after JSON-deserialization.	2019-02-22 15:14:38 -07:00
Brian Litwin	bd669f292f	Refactor pagerankGraph's node filter to throw error at call site (#1106 ) Inspired by a [suggestion] @decentralion made to improve #1105 This will enable `pagerankGraph` to throw an error when it is called with invalid option parameters. Previously, to elicit this error we had to access the iterator through `Array.from()` or similar. Test plan: Yarn test passes. Specifically, I removed the `Array.from()` wrapper around `pagerankGraph` in the test that checks to see that `pagerankGraph` throws an error when `nodes()` is passed invalid options. [suggestion]: https://github.com/sourcecred/sourcecred/pull/1105#pullrequestreview-206496537	2019-02-21 19:25:28 -05:00
Brian Litwin	42669cd160	PagerankTable: Replace topLevelFilter with NodeType in props (#1103 ) The motivation for this change is to make it easier to access the selected Node's `name` prop for #576, in which we plan to show a Card displaying summary stats for the selected node. With only the `topLevelFilter` available, it's trickier than it needs to be to find out a node type's `name`. Test Plan: * Yarn test passes. * Visual/Manual inspection of table doesn't surface any issues. * Updated `it("filter defaults to defaultNodeFilter if available")` to `it("selectedNodeType defaults to defaultNodeType if available")`. * Verified that the above new test is failable in several ways by mangling the tests to test for the wrong node type and mangling the code to set the wrong node type. * Since we factored out 'topLevelFilter' and 'defaultNodeFilter', running `git grep -i topLevelFilter` and `git grep -i defaultNodeFilter` turns up empty, just to make sure those terms aren't hanging around to confuse anybody in the future. * I don't think changing the `prop` parameter warrants any additional tests, as the current tests verify that the prop is passed in correctly. This was at @decentralion's suggestion, following the Contributing Guideline's Kent Beck quote of making the easy change to make the change we were originally after (#576) easier. 🙌	2019-02-21 14:27:18 -05:00
Dandelion Mané	c353efff36	Add a test helper function for converged graphs (#1093 ) Really minor refactor, adds a `convergedPagerankGraph` helper method which provides a converged pagerank graph. :) Test plan: `yarn test` suffices.	2019-02-18 13:10:55 -07:00
Brian Litwin	4adbec03c2	Highlight tableRows on :hover and :focus-within (#1059 ) * Highlight tableRows on :hover and :focus-within Resolves #1041 The purpose of this commit is to make the pagerankTable easier to read, as it's currently difficult to distinguish which score is associated with which row because of the tight spacing of the rows and the space between the score column and the row detail column. @wchargin provided the implementation using `linearGradient()` and `backgroundImage`s. A side effect of highlighting the row on `focus-within` is that the rows will become highlighted when the expand button is clicked, which we decided was acceptable. Test plan: Yarn test passes. To test the new highlight behavior, visual/manual inspection passes. Also added the Aphrodite className to the snapshot tests. The combination of testing the className + inline style props should make these tests sensitive to UI changes in the future. Screenshots: <img width="939" alt="screenshot 2019-02-17 15 46 34" src="https://user-images.githubusercontent.com/26695477/52918955-332f5280-32cb-11e9-87d3-887c8877116a.png"> <img width="931" alt="screenshot 2019-02-17 15 45 10" src="https://user-images.githubusercontent.com/26695477/52918953-2f9bcb80-32cb-11e9-9356-82c6dccab4ae.png"> * bump CI	2019-02-18 07:46:04 -05:00
Brian Litwin	23f3f61e1d	Add empty node prefix test case to graph.test (#1091 ) Suggested by @decentralion in his review of #1090 Test plan: yarn test passes. Also verified that the new test case is failable, if you pass in the wrong array of nodes to `expect()` or if you mangle the node filter code.	2019-02-17 19:59:24 -07:00
Brian Litwin	81b7002ce8	Add optional node prefix filter to pagerankGraph (#1090 ) Continuing work on #1020. Adding an optional parameter to `nodes()` which enables optional node prefix filtering. Test plan: @decentralion suggested on Discord that the tests should verify: 1) the parameter was passed to `_graph` correctly 2) the augmentation logic was applied correctly The tests I added are identical to the tests in `graph.test`, except that they verify that the result of `pagerankGraph` matches that of `graph`. On one hand, this creates a dependence on `graph`, as these tests don't verify that the filter works correctly, only that graph has applied the filter and returned the iterator. However, my prevailing thought is that it isn't `pagerankGraph's` responsibility to test the behavior of `graph`, and so testing the exact filter results of `pagerankGraph` like we do in `graph.test` isn't the best strategy, and testing that `pagerankGraph`'s results equal `graph`'s results is a better strategy. The tests also check that a `score` is provided alongside each `node` in the iterator, to minimally satisfy @decentralion's second spec. yarn test passes.	2019-02-17 15:24:10 -05:00
Dandelion Mané	17345fcca9	PagerankGraph: Add toJSON/fromJSON (#1088 ) * PagerankGraph: Add toJSON/fromJSON This commit adds serialization logic to `PagerankGraph`. As with many things in PagerankGraph, it's based on the corresponding logic in `Graph`. Much like graph, it stores data associated with nodes and edges (in this case, the scores and edge weights) in an ordered array rather than a map, so as to avoid repetitiously serializing the node and edge addresses. Test plan: Unit tests added, and they should be sufficient. Also take a look at the included snapshot.	2019-02-16 15:47:38 -07:00
Dandelion Mané	7851c1b007	Add `PagerankGraph.equals` (#1087 ) Part of ongoing work for #1020. Adds an equals method for the PagerankGraph. This is really quite straightforward, the logic is based on the matching logic for `Graph.equals`. Tests added. Test plan: The added tests are comprehensive, and they pass.	2019-02-16 11:52:38 -07:00
Dandelion Mané	7bc0d6956a	Retrieve sorted nodes/edges from GraphJSON (#1015 ) As discussed in #1004, we want to be able to package metadata with a graph's nodes and edges. We can do this much more compactly if we store the metadata as an array, ordered by the corresponding node/edge address, rather than storing a map. The disadvantage is that clients then need to manually sort the graph addresses to deserialize. This commit adds public methods that allow a client to efficiently retrieve the sorted addresses from the GraphJSON (where they are already serialized). This behavior is tested. Note that we appropriately don't allow clients to peek and directly depend on the exact representation of GraphJSON, we just promise that sorted address retrieval is possible. Test plan: Unit tests added (and I verified that breaking the sorting will fail the test).	2019-02-14 12:22:56 -07:00
Dandelion Mané	b51491ce1a	Start work on the PagerankGraph (#1057 ) * Start work on the PagerankGraph This commit begins work on the `PagerankGraph` class, as described in [#1020]. As of this commit, the `PagerankGraph` has basic functionality like retrieving nodes and edges, and running PageRank. However, it is missing utility functionality like equality testing and serialization, and doesn't yet have score decomposition logic. This was mostly produced during a [live coding session]. Thanks to @BrianLitwin, @anthrocypher, and @wchargin for participating. Test plan: The new code is thoroughly unit tested. Please review the test coverage, and also the quality of the documentation. [#1020]: https://github.com/sourcecred/sourcecred/issues/1020 [live coding session]: https://github.com/sourcecred/mission/issues/14 * Improvements from self-review - Don't allow PRG around empty graph, as there's no way to make it a valid probability distribution * Add issue ref in TODOs	2019-02-14 11:24:35 -07:00
Dandelion Mané	dcda8bde1d	Report Markov Chain convergence statistics (#1053 ) This commit modifies `markovChain.findStationaryDistribution` so that in addition to returning the final distribution, it also reports the final convergence delta. This is motivated by the proposed API for the new PagerankGraph (see [#1020]). Also, I think it makes a nice addition to the test code. Note that this slightly changes the output from `findStationaryDistribution`, because we now return the first distribution that is sufficiently converged, rather than that distribution with one additional Markov action. Test plan: Unit tests are updated, and `yarn test` passes. [#1020]: https://github.com/sourcecred/sourcecred/issues/1020 Thanks to @BrianLitwin for semi-pair-programming it Thanks to @wchargin for extensive review feedback.	2019-02-12 19:28:53 -07:00
Dandelion Mané	c428ee01a3	Fill in edge type descriptions (#1083 ) Pull #1080 added in a description field for edge types, but put in a placeholder message for each actual description. This pull adds in descriptions for each edge type. Test plan: `yarn test` passes, and additionally `git grep 'TODO: Add a description for this edge type'` returns no hits. Reviewed by @BrianLitwin and @wchargin.	2019-02-12 18:18:54 -07:00
Dandelion Mané	a56c941b80	Enable loading private git repositories (#1085 ) * Enable loading private git repositories This commit enables loading private repositories, assuming that the user has ssh-agent configured with keys to allow cloning the private repository, and has provided a GitHub API token with permissions for the repository in question. I have not added automated testing. I don't think a cost-benefit analysis favors adding such tests at this time: - This code changes very infrequently, and so is unlikely to break - If it does break, it will be pretty easy to catch and to fix - the @sourcecred org is on a free plan, which doesn't allow private repos, so setting up the test case is a bit of a pain Test plan: `yarn test --full` passes, so I haven't broken existing Git clone behavior. Locally, I am able to load private repositories. * Remove unnecessary process import.	2019-02-11 14:36:14 -07:00
expravit	21d7f09d65	redirect routing for prototype (#1030 ) Fixes #1019. Test plan: Loading the prototype works, as does clicking through to different prototype pages. “Running `git grep -F '/prototypes/'` returns no results; before this commit, it yielded 2 results.”	2019-02-10 14:30:28 -07:00
Ian Darrow	642a62437b	Update WeightSlider.js to allow 0 weights (#1005 ) This commit #811, allowing users to set the weights of node/edge types to 0. The WeightSlider now sets the weight to 0 when its dragged to its minimum value. The logic for converting between weights and sliders has also been made more robust, and is more thoroughly tested. In cases where we wanted to set the weight to 0 (e.g. backwards Reaction edges), the default weight has been changed. Test plan: Loading the UI, check that the sliders still work as expected (dragging them changes the displayed weight, dragging to the far left sets weight to 0). Check that the weights are consumed as expected (setting weight for issues to 0 leads to no cred for issues). Check that the weights for backwards reaction edges now have 0 weight. `git grep "TODO(#811)"` returns no hits.	2019-02-10 13:41:00 -07:00
Dandelion Mané	c6afe5f9d5	Fix a build break due to merge conflicts (#1082 ) PR #1075 added a new EdgeType, and #1080 added a new field to EdgeTypes. Both PRs merged and this broke the build. This very trivial commit fixes the build breakage in a noncontroversial way (copies the placeholder edge description used for every other edge over). Test plan: `yarn test` passes.	2019-02-07 14:27:15 -07:00
Ana Noemi	466d33e4e6	Add descriptions for EdgeTypes (#1080 )	2019-02-07 12:15:32 -07:00
Ana Noemi	6a9be3b7f4	Revert "Add descriptions for EdgeTypes (#1074 )" (#1079 ) This reverts commit `1e78437f71`.	2019-02-01 11:52:35 -08:00
Ana Noemi	1e78437f71	Add descriptions for EdgeTypes (#1074 ) * Add descriptions for EdgeTypes * Displays edge and node description tooltip in weight configuration UI * Update tests	2019-01-30 15:48:22 -05:00
Brian Litwin	0254f54375	Update github example data (#1077 ) Updating github example data with support for 🚀 and 👀 reaction types. This follows #1068 and @decentralion updating the archived repo with the new reaction types. `src/plugins/github/fetchGithubRepoTest.sh -u` (as @decentralion suggested) updated `example-github.json` `yarn unit` caught two tests with failing snapshot tests (`createGraph.test` and `relationalView.test`), so I updated those with `yarn unit -u` `yarn test -full` caught a failing snapshot test at `sharness-full`, resolved by updating the snapshot in `view.json.gz` with `UPDATE_SNAPSHOT=1 yarn test --full`. Thanks to @wchargin for the [explanation] on how to resolve that issue. [explanation]: https://github.com/sourcecred/sourcecred/pull/1077#pullrequestreview-196805017 Test Plan: `yarn test --full` is passing. Additionally, the commands: ```sh filepath="./sharness/__snapshots__/example-github-load/data/sourcecred/example-github/github/view.json.gz" && [ -f "${filepath}" ] && # sanity check diff -u \ <(git show "HEAD:${filepath}" \| gzip -d \| jq .) \ <(gzip -dc "${filepath}" \| jq .) \ ; ``` yields the following output: ``` --- /dev/fd/63 2019-01-27 08:34:15.020387301 -0500 +++ /dev/fd/62 2019-01-27 08:34:15.021095696 -0500 @@ -654,6 +654,22 @@ "subtype": "USER", "login": "decentralion" } + }, + { + "content": "ROCKET", + "user": { + "type": "USERLIKE", + "subtype": "USER", + "login": "decentralion" + } + }, + { + "content": "EYES", + "user": { + "type": "USERLIKE", + "subtype": "USER", + "login": "decentralion" + } } ] } ``` Again, thanks @wchargin's for providing those commands and accompanying explanation.	2019-01-27 13:50:48 -05:00
Brian Litwin	61266cace7	Update reaction types (#1068 ) Resolves #1054 Added "ROCKET" and "EYES" to the list of reaction types. Added "ROCKET" as a valid cred signal, kept "EYES" invisible. My approach was to use `git git grep THUMBS_UP '.js'` and `git grep ThumbsUp '.js'` to find all the relevant files, as suggested in #1054 Test Plan 1) Inspecting Sourcecred/Mission's UI: [#13] contains: GOT 🚀 FROM 1 user @BrianLitwin contains: REACTED 🚀 TO 1 issue @BrianLitwin contains: REACTED 🚀 TO #13 2) Yarn Test passes 3) `github/edges.test` includes a snapshot test to verify that we can create an edge using ROCKET 4) @wchargin also noted that: ```sh diff -u <(git grep -c 'THUMBS_UP' '.js') <(git grep -c 'ROCKET' '.js') diff -u <(git grep -c 'ThumbsUp' '.js') <(git grep -c 'Rocket' '.js') ``` passes. `graphql/mirror.test` now includes "ROCKET" and "EYES" in the example GithubSchema, but their inclusion has no effect on any tests. Screenshots 1. <img width="378" alt="screenshot 2019-01-22 09 02 12" src="https://user-images.githubusercontent.com/26695477/51540428-6c87b600-1e24-11e9-8334-1d9d993dce01.png"> 2. <img width="525" alt="screenshot 2019-01-22 09 02 41" src="https://user-images.githubusercontent.com/26695477/51540472-84f7d080-1e24-11e9-8847-245c0c09ddd6.png"> <br> Shoutout to [this comment], which saved me an untold amount of head-scratching, and also @Decentralion's help debugging in the Issue thread. [#13]: https://github.com/sourcecred/mission/issues/13 [this comment]: `e0762303d4/src/plugins/github/graphqlTypes.test.js (L13-L15)`	2019-01-24 06:24:22 -05:00
Ana Noemi	e0762303d4	Add descriptions for NodeTypes (#1044 ) * Add descriptions for NodeTypes As highlighted by @decentralion in issue #807, we need descriptions for Node and Edge types in the UI to explain to users what each Node and Edge type does. This PR modifies the type definition for `NodeType` and adds a `+description: string` field, then updates all NodeTypes throughout the codebase with descriptions. Test plan: Verify that all tests pass and the descriptions makes sense.	2019-01-21 16:16:56 -08:00
Dandelion Mané	5c2f232017	Expose the Graph's modification count (#1055 ) This commit adds a new `modificationCount` method to `Graph`, which exposes's that graph's modification count. This enables clients to write cached data structures on top of Graph, knowing that they can programatically detect when the cache has been invalidated. Test plan: Unit tests have been addded; `yarn test` passes. This commit is motivated by work on #1020.	2019-01-21 10:08:27 -08:00
Dandelion Mané	e92f247305	Rename "AppAdapter" -> "ExplorerAdapter" (#1052 ) There are two kinds of plugin adapters: adapters for doing cred analysis, called "analysis adapters", and adapters for the cred explorer, which are confusingly called "app adapters". This commit decreases the confusion by renaming app adapters to explorer adapters across the codebase. In a future commit, I will add documentation to the adapter interfaces so that it is clearer to a newcomer to the codebase why these interfaces exist. Thanks to @BrianLitwin, who asked a question during [office hours] that surfaced this issue. [office hours]: https://github.com/sourcecred/mission/issues/12 Test plan: `yarn test` passes, suggests that this rename went off without a hitch. Code review as a sanity check. Also: grepping for `AppAdapter` returns 0 results: ``` $ git grep AppAdapter \| wc -l 0 ``` Note: After producing this commit, I can confirm that the word "adapter" starts to look like utter gibberish after you type it often enough.	2019-01-19 17:02:31 -08:00
Dandelion Mané	2d8a25afc8	Add documentation for `NodeType` and `EdgeType` (#1043 ) This commit adds some docstrings for the concepts of NodeType and EdgeType. I also swapped the order so that `NodeType` comes first, which is more consistent with usage across the codebase. This commit makes no changes to the actual code; the only effects are re-organization and documentation. Test plan: `yarn test` && human inspection	2019-01-16 18:02:39 -08:00
William Chargin	989d1f359f	homepage: remove misleading CSS import (#1045 ) Summary: We use Aphrodite, not CSS imports, for styling. We do have a small `index.css` file that is included during server-side rendering, and is only referenced from `src/homepage/server.js`. But our `index.js` file also has a superfluous `import "./style.css"`, which might suggest that we support CSS imports more generally. This patch removes that import. Thanks to @brianlitwin on Discord for pointing out that this might be confusing. Test Plan: Verified that, under both `yarn start` and `yarn build`, the appearance is the same, and the document still includes a `<style>` element with the contents of `index.css` (which is included by `server.js`). wchargin-branch: remove-css-import	2019-01-09 23:22:20 -08:00
Dandelion Mané	6a010291df	Add documentation to the Graph module (#1025 ) * Add documentation to the Graph module This commit adds a module-level docstring that gives an overview of the Graph class and its importance to SourceCred, as well as adding docstrings to specific methods. Test plan: In addition to review by the SourceCred maintainers, this should be reviewed by at least one person who is not familiar with the codebase, so that we can verify that it's actually working as documentation. :) * Incorporate @wchargin's many suggestions. Test plan: Human review.	2019-01-07 18:35:49 -08:00
Dandelion Mané	24895b3c7d	Make `yarn test` more quiet (#1037 ) This commit adds a new runOption for execDependencyGraph, namely `printVerboseResults`. If this flag is true, then execDependencyGraph will print a "Full Results" section along with the standard error and standard out of every task, regardless of whether it failed or succeeded. (Note, this is the existing behavior for all invocations prior to this commit). If the flag is not true, then execDependencyGraph will not print a full results section, and stdout/stderr will be logged only for tasks that fail. This commit also modifies `yarn test` to use the new flag so that it prints verbose tests only when the `--full` option is provided. This is consistent with our sharness behavior: we print the full sharness logs only when `--full` was provided. This fixes #1035, and ensures that running `yarn test` has a high signal to noise ratio (i.e. it only shows an enumeration of top level tasks). This improves the developer ergonomics of SourceCred by not having a super commonly used and core script spam the user with mostly irrelevant information. Test plan: Run `yarn test` when all tests are passing, and observe that the output has much less noise: ``` yarn run v1.12.3 $ node ./config/test.js tmpdir for backend output: /tmp/sourcecred-test-6337SZ9smvWsWvqE Starting tasks GO ensure-flow-typing GO check-stopships GO check-pretty GO lint GO flow GO unit GO backend PASS check-stopships PASS ensure-flow-typing PASS flow PASS backend GO sharness PASS sharness PASS check-pretty PASS lint PASS unit Overview Final result: SUCCESS Done in 11.66s. ``` Run `yarn test` when there is a real failure (e.g. a unit test failure) and observe that full details on the failure, including the output from stdout/stderr, is still provided. Run `yarn test --full` and observe that full, verbose logs are provided.	2019-01-05 18:16:29 -08:00
Brian Litwin	2b9cef66ed	Add project title to explorer page (#1032 ) Resolves #1027 Using `repoId.owner/repoId.name` for the project title because that is how projects are identified on `PrototypePage`. Created a `<ProjectDetail />` component inside `<App />` that consumes a `RepoId` and renders a title. Test Plan: Added two unit tests: The first verifies that the parent `<App />` component instantiates a `<ProjectDetai />` component with the correct props. The current correct prop is a `RepoId` object. The second test verifies that the `<ProjectDetail />` component renders the title correctly given the `RepoId`, ie as a `<p>` element with `repoId.owner/repoId.name` for text. Visual tests verify that the title is above the Analyze Cred button, and that clicking from one project to another renders the appropriate title for separate projects. Attaching a screenshot as a comment at #1032 for reference: <img width="1253" alt="screenshot 2019-01-04 13 40 03" src="https://user-images.githubusercontent.com/26695477/50706562-34aeff00-102c-11e9-9c1c-6c1e3fa6c415.png">	2019-01-04 12:56:38 -08:00
Dandelion Mané	7c7fa2d83d	Graph: move invariant checker to bottom of class (#1026 ) This moves the invariant checking code from the top of the Graph class to the bottom. Most readers of this file will probably be more interested in seeing the API, and reading the invariant checker first is likely to be confusing and off-putting. Test plan: `yarn test` suffices. No semantic change.	2019-01-03 14:15:40 -08:00
Dandelion Mané	bbe773bb67	Elide assertValid & assertValidParts in production (#1017 ) This commit substantially improves SourceCred's performance in production. Measurement methodology: I create a new tab in Chrome, navigate to my local prototypes, and select go-ipfs. I then turn on profiling, and click the analyze button, and then turn off profiling when analysis is done. I then go to the "bottom-up" tab in the JS analysis box on the bottom and sort by "Total Time". __Before this commit:__ \| fn \| total time \| time as % \| \|:---------------- \| ----------:\| ---------:\| \| assertValid \| 815ms \| 8.6% \| \| assertValidParts \| 261ms \| 2.7% \| __After this commit:__ \| fn \| total time \| time as % \| \|:---------------- \| ----------:\| ---------:\| \| assertValid \| 21ms \| 0.2% \| \| assertValidParts \| 23ms \| 0.3% \| Test plan: `yarn test`, also performance measurement as described above. Fixes #1011.	2018-12-01 17:33:44 -08:00
Dandelion Mané	973a72fe46	Update the blacklisted object ids (#1018 ) This adds a blacklisted id for @greenkeeper, a bot which used to be a user. This is a temporary fix until we solve #998. Test plan: `yarn test` passes. Before this commit, attempting to load `probot/probot` fails. After this commit, it succeeds.	2018-11-28 12:39:19 -08:00
Dandelion Mané	794b93e397	Improve performance of pagerank `decompose` (#1007 ) When I implemented this function, I incorrectly assumed that `lodash.sortBy` only calls subsequent accessor functions if there is a tie from the first accessor. Actually, it calls it every time. We can avoid lots of wasteful JSON.serialization by just grabbing the exact properties of interest. Test plan: For correctness: `yarn test` suffices, as this functionality is already tested. For performance improvement: I ran the full load+analyze workflow, in Chrome, on twbs/bootstrap. Before this change, decompose took 6.9s; after this change, it takes 1.3s, for a 5.3x speedup. Close #943.	2018-11-16 22:26:55 -08:00
William Chargin	80b458d719	core: allow repo ID registry to store metadata (#1003 ) Summary: Our registry was defined to simply be a list of IDs. This is insufficiently flexible; we want to be able to annotate these IDs with, e.g., last-updated times (#989). This commit wraps the entries in a simple object, updating clients appropriately. Test Plan: - Run `node ./bin/sourcecred.js load sourcecred/example-github` with a repository registry in the old format, and note that it errors appropriately. - Run `yarn build` with a repository registry in the old format, and note that it errors (“Compat mismatch”). - Delete the old registry and re-run the `load` command. Note that it runs successfully and outputs a registry. Run `yarn build`; note that this works. - Load data for two repositories. Run `yarn start`. Note that the list of prototypes still works, and that you can navigate to and render attributions for individual project pages. - Verify that `yarn test --full` passes. wchargin-branch: repo-id-registry-metadata	2018-11-09 17:28:39 -08:00
William Chargin	332e776317	deps: upgrade `flow-bin@^0.86.0` (#1002 ) Summary: There have been some breaking changes that require new type annotations, which is a good thing: these prevent `any`-leakage. Test Plan: Run `yarn flow`. wchargin-branch: flow-v0.86.0	2018-11-09 09:24:40 -08:00
William Chargin	0a6eca7d79	link: verify that routes have trailing slash (#1001 ) Summary: This serves as a regression test for #1000. Test Plan: Note that `yarn unit` passes with this patch but fails if the change to the code is reverted, or if the patch in #1000 is reverted. Note that `yarn build` also passes but fails if the patch in #1000 is reverted. Note also that `yarn test --full` passes. wchargin-branch: link-verify-trailing-slash	2018-11-08 20:53:30 -08:00
William Chargin	24c1873dca	site: fix homepage link to prototypes page (#1000 ) Summary: Prior to this commit, clicking the in-copy link to the prototypes page would raise a console error: > Warning: [react-router] Location "/prototype" did not match any routes Test Plan: Run `yarn start` and click the link. wchargin-branch: site-fix-homepage-prototype-link	2018-11-08 20:40:57 -08:00
William Chargin	897ba78d5c	site: fix prototypes page dimensions (#999 ) Summary: Prior to this commit, the prototypes page, which lists just a handful of repositories, was rendered with a vertical scrollbar: you had to scroll 200px to see the version info. This is silly. The `height: 100%` is necessary not to get it to fill up the whole page, but to get it to _not_ fill up ~30 extra pixels. I have no idea why. Test Plan: Run `yarn start` and note that `/prototypes/` now renders without a scrollbar, and with the version info in the bottom-right corner. wchargin-branch: site-fix-prototypes-page-dimensions	2018-11-08 20:39:23 -08:00
Dandelion Mané	8666f9ac1a	cleanup: remove unused field on ScoredConnection (#994 ) This resolves an outstanding TODO in pagerankNodeDecomposition to remove the unused sourceScore field. I have removed it, and it was indeed unused. Test plan: `yarn test` passes.	2018-11-01 18:55:32 -07:00
William Chargin	beccac822f	MapUtil: provide exact output from `toObject` (#993 ) Summary: The `MapUtil` map–object conversion functions used inexact objects for both input and output. They are in fact stronger than that: they can accept arbitrary inexact objects and return arbitrary exact outputs. (Recall that exact objects are subtypes of their inexact counterparts, so this is the maximally permissive combination.) Test Plan: Unit tests added. The “can return an exact object” test fails Flow before this change. The other tests would have passed already. wchargin-branch: maputil-exact-output	2018-11-01 18:55:14 -07:00
Dandelion Mané	252d8d5c99	Move repoIdRegistry to core (#992 ) RepoIdRegistry is used across the project, but not in the explorer. So it makes very little sense that it live in the explorer module. It's now moved to core. Test plan: `yarn test --full` passes	2018-11-01 18:11:48 -07:00
Dandelion Mané	6b8cb66013	Remove cred feedback url configurability (#991 ) We added a configurable cred feedback url on the theory that we would create a separate discourse post to collect feedback for each project in particular. We've now realized that no one is using this, so it's just vestigial complexity now. So I'm removing the logic for configuring the feedback url on a per-project basis. Instead, we will always link to a Google form for collecting feedback. Test plan: `yarn test --full` passes, and I manually checked the links.	2018-11-01 17:43:37 -07:00
Dandelion Mané	29065f44d6	Remove the repository select from explorer/ (#988 ) Historically, a single cred explorer instance could load many different repositories. This turned out to be an anti-feature: we'd rather have a particular url hardlink to exploring the cred for a particular project. This commit removes the repository select from the explorer, and instead mandates that the explorer always has the RepoId passed down from above. Besides providing a better UX, this also greatly simplifies the logic for the explorer, since we no longer have an "initializing state" that doesn't have any RepoId. This builds on the work in #984, and swaps out the old "prototype" page (which has been rendered non-functional by this change) for the new "prototypes" page. Note that it stays at the same route, so links to sourcecred.io/prototype will continue to function. Test plan: Ran `yarn test --full`, and verified that `yarn start` produces a working site.	2018-11-01 16:10:01 -07:00
William Chargin	738853cd02	homepage: render project-specific prototype pages (#984 ) Summary: Currently, we render simply render a placeholder. Soon, we’ll remove the repository selector dropdown from the cred explorer, and render project-specific cred attributions. Test Plan: Run `yarn start`. Navigate to `/prototypes/` and observe: ![Screenshot of `/prototypes/`](https://user-images.githubusercontent.com/4317806/47877810-03227900-ddda-11e8-9a17-28398d83059f.png) Note that the links point to URLs like `/prototypes/sourcecred/example-github`: ![Screenshot of a project page](https://user-images.githubusercontent.com/4317806/47877888-35cc7180-ddda-11e8-95db-9f5099e146a8.png) Then, check that `yarn test --full` passes. wchargin-branch: homepage-project-pages	2018-11-01 15:19:52 -07:00
William Chargin	665bb67e33	homepage: add prototypes listing (#983 ) Test Plan: Apply the following patch: ```diff diff --git a/src/homepage/routeData.js b/src/homepage/routeData.js index 32d3eb65..aac7fc9a 100644 --- a/src/homepage/routeData.js +++ b/src/homepage/routeData.js @@ -38,7 +38,10 @@ const routeData /: $ReadOnlyArray<RouteDatum> / = [ path: "/prototypes/", contents: { type: "PAGE", - component: () => require("./PrototypesPage").default([]), + component: () => + require("./PrototypesPage").default([ + {owner: "sourcecred", name: "example-github"}, + ]), }, title: "SourceCred prototypes", navTitle: null, // for now ``` Then, load <http://localhost:8080/prototypes/> and see that there is an entry in the list, and that it links to <http://localhost:8080/prototypes/sourcecred/example-github/>. Note that clicking the link raises a console error because there is no such route. wchargin-branch: homepage-prototypes-page	2018-11-01 13:30:36 -07:00
Dandelion Mané	a9db2b0919	webpack: expose repo registry at build time (#981 ) Summary: We want to remove the repository selector dropdown on the cred explorer homepage and instead render a separate web page for each project. To do this, we need to know which pages to render statically. We choose to ingest this information from the state of the repository registry at build time. This commit adds an environment variable `REPO_REGISTRY` whose contents are the stringified version of the repository registry, or `null` if SourceCred has been built for the backend. This variable is defined with Webpack’s `DefinePlugin`, so any code bundled by Webpack can refer to it via `process.env.REPO_REGISTRY` both on the server and in the browser. Paired with @wchargin. Test Plan: Sharness tests modified; running `yarn test --full` suffices.	2018-11-01 12:38:18 -07:00
Dandelion Mané	244c01d764	Move plugin choice from explorer to homepage (#979 ) The explorer no longer ships with a set of default plugins. (This made an inappropriate dependency from explorer/ to plugins/, and complicated explorer's contract as a generic component.) Instead, the homepage module is responsible for choosing the plugins to display on the homepage. Test plan: `yarn test --full` passes, and `yarn start` reveals a functioning homepage and prototype.	2018-11-01 11:39:07 -07:00
Dandelion Mané	0ad1e0557f	Move `version.js` to core (#977 ) Currently version is located in `homepage/`, which doesn't make much sense, since it's versioning the whole project. We move it to core. Test plan: `yarn test --full`	2018-11-01 11:33:03 -07:00
William Chargin	64500f53cb	mirror: remove "demo" module (#978 ) Summary: This was used for ad hoc testing of the Mirror module before it was integrated into SourceCred. We haven’t kept it up to date with schema changes, and it is no longer needed: you can just run `sourcecred load`. This was also the only untested code in the `graphql/` package, so it is nice to remove it. Test Plan: Running `yarn test --full` passes. wchargin-branch: remove-mirror-demo	2018-11-01 11:29:57 -07:00
William Chargin	d19227c268	github: use blacklists to unblock twbs/bootstrap (#973 ) Summary: This adds object IDs to the GitHub GraphQL blacklist such that the `twbs/bootstrap` repository can be loaded. Ingesting the Mirror-extracted data into the RelationalView yields the warnings ``` IssueComment[MDEyOklzc3VlQ29tbWVudDEwNTI4Mzk4Ng==].reactions: unexpected null value IssueComment[MDEyOklzc3VlQ29tbWVudDI0NTQ3OTM3OA==].reactions: unexpected null value IssueComment[MDEyOklzc3VlQ29tbWVudDMwNDE4NzIzMg==].reactions: unexpected null value ``` because we have nulled out these `Reaction`s in their enclosing connections. This is expected. Test Plan: Run `yarn backend` and `node ./bin/sourcecred.js load twbs/bootstrap`. Run `yarn start` and note that the cred attribution renders properly. (Loading the GitHub data may take an hour or two. The resulting SQLite3 database is 172MB. Ingesting it into the `RelationalView` still takes just a few seconds, and the cred attribution is rendered quickly.) wchargin-branch: github-use-blacklists	2018-11-01 11:08:17 -07:00
William Chargin	fe50ca83f6	mirror: allow blacklisting objects by ID (#972 ) Summary: This enables us to deal with GraphQL remotes that violate their contract guarantees and provide a node of the wrong type. An instance in which the GitHub GraphQL API does this is documented here: <https://gist.github.com/wchargin/a2b8561b81bcc932c84e493d2485ea8a> A Mirror cache is only valid for a fixed set of blacklisted IDs. This is necessary to ensure consistency. If the set of blacklisted IDs changes, simply remove the existing database and download again from scratch. Test Plan: Unit tests added, with full coverage. wchargin-branch: mirror-blacklist	2018-11-01 11:04:49 -07:00
Dandelion Mané	69989256f6	Rename `app/` to `homepage/` (#974 ) Now that we've moved the explorer out of app, it is more concisely described as the homepage. Test plan: Rename only. Run `yarn test`.	2018-11-01 10:19:51 -07:00
Dandelion Mané	604db14879	Pull `credExplorer` into its own top-level module (#971 ) Currently, the cred explorer is a submodule of `app`. This is somewhat confusing, as `app` is essentially our homepage, and the explorer is a standalone React application which happens to get embedded in our homepage. This commit pulls the explorer from `app/credExplorer/` into `explorer/`, which is a better organization. The `app/adapters` were actually only used by the cred explorer, so those files have been moved to `explorer/adapters`. We should rename them from "App Adapters" to "Explorer Adapters", but I didn't do that in this commit so as to minimize the (already substantial) size of the change. Also, we should rename `app/` to `homepage/` in a subsequent commit. I encountered a nasty Flow bug, which I fixed with help from @wchargin. The result is extra annotations on the demo and fallback dynamic adapters (so that the `static()` method is type annotated). Test plan: This change is massive, but it's just a rename. `yarn test` suffices.	2018-11-01 10:16:42 -07:00
Dandelion Mané	c997f4e1ec	Collect web-related utilities in `webutil/` (#970 ) I'm planning to pull `credExplorer` out of `app` and into its own top-level module. This is a bit awkward, as `credExplorer` depends on a lot of little modules that are currently collected in `app/`. To resolve this, I pull all of these little utility modules into `webutil/`. It's not a totally principled grouping, but it's quite convenient and keeps these rarely changing modules out of the way. Test plan: It's a file move, `yarn test` suffices.	2018-10-31 21:24:25 -07:00
Dandelion Mané	b077bd8179	Move `weightsToEdgeEvaluator` to `analysis` (#969 ) The logic for converting weights into an edge evaluator should not be coupled to the frontend application. Progress towards #967. Test plan: Very straightforward rename; `yarn test` suffices.	2018-10-31 20:14:30 -07:00
Dandelion Mané	48a66c8118	Declare the fallback plugin in analysis (#968 ) Now that the `analysis` module owns the Node and Edge types, it should own the "fallback plugin" too. (Note that it's not actually a plugin, though it somewhat acts like one.) We now declare the fallback type in `analysis`, along with a fallback analysis adapter. `app/adapters` then declares a fallback app adapter. Test plan: `yarn test` Progress towards #967.	2018-10-31 20:02:38 -07:00
Dandelion Mané	f3ddb84cbd	Move weight logic to analysis (#966 ) There's a folder called `app/credExplorer/weights` which contains the type specification for weights (for PageRank configurability), and also contains frontend code for specifying those weights. This commit creates a `weights` module under `analysis` which will contain just the logic for specifying and using the weights, without any frontend consideration. It's mostly a port of the existing logic in `credExplorer/weights`, with the caveat that app adapter related concepts have been removed, in favor of referencing the declaration instead. We then remove the duplicated logic and re-route imports. Test plan: `yarn test`	2018-10-31 13:27:03 -07:00
Dandelion Mané	86a5b532f8	Add the demo plugin (#965 ) * Add the demo plugin This ports the ad-hoc demo adapter defined in `src/app/adapters/demoAdapters.js` into its own demo plugin. This has the benefit that the demo plugin can now be depended on outside the app module, e.g. for the analysis module as well. Correspondingly, I've added a demo analysis adapter. Test plan: `yarn test`. Note that no unit tests were added, as the demo plugin is trivial. * Delete `src/app/adapters/demoAdapters.js` Now that we have an explicit demo plugin at `src/plugins/demo/`, we can remove the legacy declaration of that plugin within the `app` module. This commit deletes the old version, and re-writes all references to point to the standalone plugin. Test plan: `yarn test`	2018-10-31 13:23:46 -07:00
William Chargin	6b789d61d6	github: remove legacy continuations code (#964 ) Summary: It is time. (Replaced with #622.) Test Plan: Running `yarn flow` suffices. Running `yarn test --full` also passes. wchargin-branch: remove-legacy-graphql	2018-10-31 12:45:59 -07:00
William Chargin	2e0b17cef7	mentionsAuthorReference: remove legacy GraphQL dep (#963 ) Summary: This test has data in the old format, and uses the `RelationalView` method that automatically translates it. As we prepare to delete that code, we upgrade the underlying format of this test data. The end code is nicer to read, too (e.g., we don’t need the `connection` helper function). Recommend reviewing with `git show -b`. Test Plan: Running `yarn test` suffices. wchargin-branch: mentionsAuthorReference-remove-legacy-graphql	2018-10-31 12:36:48 -07:00
William Chargin	b77db72c1d	github: remove some deps on github/graphql.js (#962 ) Summary: A number of modules depended on the legacy `github/graphql.js` module solely to get at the `Reactions` enum object. As of #961, that object is exposed from the much lighter-weight `graphqlTypes.js`. This patch switches over the relevant imports, reducing our dependencies on this legacy module and its large bundle size. Test Plan: It suffices to run `yarn flow` and verify that the two values being imported are identical. wchargin-branch: github-use-generated-enums	2018-10-31 12:24:06 -07:00
William Chargin	2377f1980f	schema: generate runtime constants for enum values (#961 ) Summary: We have a `const Reactions` convenience enum in `github/graphql.js`. That value is useful, but that module is slated to die. This commit extends our Flow type generation script to include these values. Test Plan: Existing unit tests suffice. wchargin-branch: schema-generate-enums	2018-10-31 12:22:58 -07:00
Dandelion Mané	9bccb7661d	Change the GitHub default TTL to one week (#960 ) While we wait for explicit configurability, one week is a better default for the many SourceCred demos I maintain. Test plan: n/a	2018-10-31 12:21:08 -07:00
Dandelion Mané	b86b0b32ec	Add `analysisAdapter` (#950 ) For #704, we're adding plugin adapters that are specific to the needs of the analysis module. They have a simple scope: they just provide a way to get the declaration, and to load the corresponding graph. Adapters for the `git` and `github` plugins have been implemented, along with unit tests. Test plan: `yarn test` suffices.	2018-10-31 11:36:30 -07:00
William Chargin	8706fa9771	git: don’t warn when rendering unknown commits (#957 ) Summary: Fixes #953. See that issue for context. Test Plan: Unit tests updated. To see the change in action, load the SourceCred data and expand @decentralion’s commits-authored to find commits that were merged into non-`master` branches. Note that these commits are rendered correctly (in the same way that they were before this patch), and that there is no console error (new as of this patch). ![Screenshot](https://user-images.githubusercontent.com/4317806/47805669-1f98b580-dcf5-11e8-8683-8ee91f7f478a.png) wchargin-branch: git-no-warn-on-unknown-commit	2018-10-31 10:45:25 -07:00
William Chargin	f9bb75ef71	release: v0.2.0 (#952 ) Test Plan: Remove the SourceCred output directory, run `yarn backend`, and load data for `sourcecred/example-github` and `sourcecred/sourcecred`. Then, run `yarn start` and note that the cred explorer still works. Finally, note that `yarn test --full` passes. wchargin-branch: release-v0.2.0	2018-10-30 15:18:19 -07:00
Dandelion Mané	c4e2ec8839	Rename `PluginAdapter` to `AppAdapter` (#948 ) Now that we're planning to add adapters for the `analysis` module, we should rename the `PluginAdapter` to make it clear that it's scoped for `app`. Test plan: `yarn test` suffices	2018-10-30 00:26:22 +00:00
Dandelion Mané	cb30023a02	Factor out plugin declarations (#947 ) The plugin adapters are specific to `app/` and have logic for fetching data from the backend, producing React nodes for descriptions, et cetera. However, they also have information that is generic to the plugin itself: its name, its node/edge prefixes, and its types. This method factors out the generic info into a `PluginDeclaration`, which is a type (rather than an interface). Then, the plugin adapter has a `declaration` method which returns the declaration. Current users of the plugin adapters get additional mechanical complexity because they need to call `.declaration().property` rather than `.property()`, but this is not too significant. The main benefit is that #704 is unblocked, as the cli `analyze` command will be able to get necessary information from the declaration. As an added benefit, the organization of plugin code is somewhat improved. Test plan: `yarn test` sufficies, but I also ran `yarn start` and checked the UI behavior to be safe.	2018-10-30 00:13:11 +00:00
Dandelion Mané	5f2cc56172	Move `{Node,Edge}Type` to src/analysis/types.js (#946 ) Test plan: `yarn test` Part of ongoing work on #704	2018-10-29 23:49:53 +00:00
Dandelion Mané	917b793aca	Move some files from core/attribution/ to analysis/ (#944 ) The `core/attribution/` folder has some code that really is "core" in that it deals with very basic concepts around converting graphs to markov chains and running PageRank on them, and some code that is less "core", like for normalizing scores and doing analysis on them. To make progresson #704, we need an intermediary directory that has analysis-related code that is e.g. aware of Node and Edge types, and weights on those types, and can use them to run weight-informed PageRank. That code shouldn't live in the app directory (since it is not coupled to the frontend rendering), but also shouldn't live in core (since "core" is basically finalized code with fully baked abstractions, and per #710, this is not true of the node/edge type system). Thus, I've decided to create the `analysis` directory. To get that directory started, I've moved the non-core code in `core/attribution/` to `analysis/`. Test plan: `yarn test` passes, which is all we need, since this is a straightforward file rename.	2018-10-29 22:54:15 +00:00
Dandelion Mané	542e2f9723	Add skeleton of `sourcecred analyze` (#942 ) The `analyze` command is the first step towards #704 and #703. When fully implemented, it will run PageRank for a loaded repository, generating a complete graph and cred attribution. For now, this just adds a scaffold. It does basic argument parsing, and has help text, but the actual command is not yet implemented. Test plan: Unit tests verify that the analyze command is hooked into `sourcecred` and `sourcecred help`, and that it responds to the `--help` command and parses its arguments appropriately.	2018-10-29 22:27:06 +00:00
William Chargin	08219f98bf	fetchGithubRepo: use Mirror pipeline (#937 ) Summary: As of this commit, `node ./bin/sourcecred.js load` uses the Mirror code, and the legacy continuation-fetching code is not included in the `sourcecred.js` bundle. We do not yet perform the commit prefetching described in #923. The code should be plenty fast for repositories that merge pull requests at least occasionally. Test Plan: Running `yarn test --full` passes. Loading `sourcecred/sourcecred` works and generates a reasonable credit attribution. Loading it again completes immediately. wchargin-branch: fetchGithubRepo-mirror	2018-10-28 12:03:06 -07:00
William Chargin	e2c99c418b	relationalView: use new data format (#934 ) Summary: This makes significant progress toward #923. As of this commit, it is possible to use the Mirror module for the whole loading pipeline. This process may be slow for repositories that do not use pull requests at all (more precisely, that have large connected commit subgraphs none of whose nodes is the merge commit of a pull request; see #920 for details) so it is not yet the default codepath. Test Plan: Existing unit tests should suffice. For extra testing, I’ve added a script that fetches a repository both via the old continuations logic and the new Mirror logic, then constructs relational views and checks whether the data is the same. For `example-github`, the views are identical. For `sourcecred`, they are not: the old continuations logic erroneously omits two commits, which the Mirror logic includes. You can run the test like this: ``` $ node ./bin/testContinuations.js \ > sourcecred sourcecred MDEwOlJlcG9zaXRvcnkxMjAxNDU1NzA= \ > /tmp/continuations.json /tmp/mirror.json \ > 2> >(jq . >&2) { "child": "0d38dde23a6de831315f3643a7d2bc15e8df7678", "parent": "cb8ba0eaa1abc1f921e7165bb19e29b40723ce65", "type": "UNKNOWN_PARENT_OID" } { "child": "d152f48ce4c2ed1d046bf6ed4f139e7e393ea660", "parent": "de7a8723963d9cd0437ef34f5942a071b850c0e7", "type": "UNKNOWN_PARENT_OID" } Different. Saving to disk... ``` Use `diff -u <(jq . /tmp/continuations.json) <(jq . /tmp/mirror.json)` to inspect the differences, and note that exactly the two missing commits have been added and that there are no other changes. (The diff is small: just 51 lines of nicely formatted JSON.) The full log is here: <https://gist.github.com/wchargin/e159cac9dcf3cc3b1efbd54f59e24e0b> I also generated the `sourcecred/sourcecred` cred attribution and viewed it with `yarn start`, which seems to work fine. wchargin-branch: relationalview-new-data-format	2018-10-23 16:42:49 -07:00
William Chargin	58e98124ac	relationalView: make snapshots order-invariant (#933 ) Summary: An upcoming commit will happen to change the order in which commits are ingested. This is not an observable change, and should not cause a snapshot failure. Test Plan: Inspection. wchargin-branch: relationalview-snapshots-order-invariant	2018-10-23 10:50:18 -07:00
William Chargin	993de9303a	github: translate old format to structured format (#930 ) Summary: This implements the translation module described in #923. See that issue for context. Test Plan: This is a mostly straightforward translation from one strongly typed data structure to another, so Flow handles most of it. As a check on the snapshot, run: ``` $ grep -e oid -e target -e mergeCommit \ > src/plugins/github/__snapshots__/translateContinuations.test.js.snap "target": Object { "oid": "6bd1b4c0b719c22c688a74863be07a699b7b9b34", "oid": "c430bd74455105f77215ece51945094ceeee6c86", "oid": "6d5b3aa31ebb68a06ceb46bbd6cf49b6ccd6f5e6", "oid": "0a223346b4e6dec0127b1e6aa892c4ee0424b66a", "oid": "ec91adb718a6045b492303f00d8e8beb957dc780", "oid": "ecc889dc94cf6da17ae6eab5bb7b7155f577519d", "oid": "ec91adb718a6045b492303f00d8e8beb957dc780", "mergeCommit": Object { "oid": "0a223346b4e6dec0127b1e6aa892c4ee0424b66a", "oid": "ec91adb718a6045b492303f00d8e8beb957dc780", "oid": "ecc889dc94cf6da17ae6eab5bb7b7155f577519d", "oid": "ec91adb718a6045b492303f00d8e8beb957dc780", "mergeCommit": Object { "oid": "6d5b3aa31ebb68a06ceb46bbd6cf49b6ccd6f5e6", "oid": "0a223346b4e6dec0127b1e6aa892c4ee0424b66a", "oid": "ec91adb718a6045b492303f00d8e8beb957dc780", "oid": "ecc889dc94cf6da17ae6eab5bb7b7155f577519d", "oid": "ec91adb718a6045b492303f00d8e8beb957dc780", "mergeCommit": null, ``` Cross-check this against [the example-github commits][commits] thus: - Note that commit `6bd1b4c` is the head commit, and is thus the root commit of the `target` chain. - Note that commits `0a22334` and `6d5b3aa`, which were merged via pull request, appear twice each: once in the history from head, and once as the merge commit of a pull request. - Note that commit `0a22334` has two parents at each occurrence. - Note that the unmerged pull request’s merge commit is `null`. [commits]: https://github.com/sourcecred/example-github/commits/master To run this on real-world data, apply the following patch: ```diff diff --git a/src/plugins/github/fetchGithubRepo.js b/src/plugins/github/fetchGithubRepo.js index 6ac201af..b14ca760 100644 --- a/src/plugins/github/fetchGithubRepo.js +++ b/src/plugins/github/fetchGithubRepo.js @@ -11,6 +11,7 @@ import {stringify, inlineLayout, type Body} from "../../graphql/queries"; import {createQuery, createVariables, postQueryExhaustive} from "./graphql"; import type {GithubResponseJSON} from "./graphql"; import type {RepoId} from "../../core/repoId"; +import translateContinuations from "./translateContinuations"; /** * Scrape data from a GitHub repo using the GitHub API. @@ -44,6 +45,11 @@ export default function fetchGithubRepo( payload ).then((x: GithubResponseJSON) => { ensureNoMorePages(x); + console.warn("Translating continuations..."); + for (const w of translateContinuations(x).warnings) { + console.warn(w); + } + console.warn("Done."); return x; }); } ``` Then run: ``` $ yarn backend >/dev/null 2>/dev/null; echo $? 0 $ node ./bin/sourcecred.js load sourcecred/sourcecred --plugin github 2>&1 \| > ts -s '%.s' 55.015740 Translating continuations... 55.037217 { type: 'UNKNOWN_PARENT_OID', 55.037273 child: '0d38dde23a6de831315f3643a7d2bc15e8df7678', 55.037290 parent: 'cb8ba0eaa1abc1f921e7165bb19e29b40723ce65' } 55.037309 { type: 'UNKNOWN_PARENT_OID', 55.037336 child: 'd152f48ce4c2ed1d046bf6ed4f139e7e393ea660', 55.037359 parent: 'de7a8723963d9cd0437ef34f5942a071b850c0e7' } 55.037383 Done. ``` Note that the two commits in question were each merged into a non-master branch, in #28 and #329 respectively. Note also that translating these continuations took just 22 milliseconds. wchargin-branch: github-translate-continuations	2018-10-22 10:01:49 -07:00
William Chargin	6499df6b6b	github: fix misc. errors in old GraphQL system (#929 ) Summary: This fixes the following issues: - Pull request reviews actually do not have reactions. - We must fetch the `id` of a `Ref`. - We must fetch the `id` of a `Commit`, `Tree`, `Blob`, or `Tag`, and should also fetch its `oid`. - Repository owners cannot be bots. - Commit and reaction authors cannot be bots, organizations, or `undefined`. Test Plan: Running `yarn test --full` passes, and the snapshot diff is clearly correct. wchargin-branch: github-fix-up-continuations	2018-10-22 09:50:24 -07:00
William Chargin	889febb7f6	github: add GraphQL schema and Flow types (#928 ) Summary: The included schema is forked from the one in `graphql/demo.js`. Primitive types have been added, and the `parents` connection has been added to commit objects per #920. (We do not include this in the demo script because without prefetching it would take a long time to load.) Test Plan: Unit tests added; run `yarn unit`. Then run `yarn backend` and verify that `node ./bin/generateGithubGraphqlFlowTypes.js` generates exactly the same output as in the types file: ``` $ node ./bin/generateGithubGraphqlFlowTypes.js \| > diff -u - ./src/plugins/github/graphqlTypes.js $ echo $? 0 ``` Change the `graphqlTypes.js` file and verify that `yarn unit` fails. As the build config has been changed, a `yarn test --full` is warranted. It passes. Finally, I have manually verified that the schema is consistent with the documentation at <https://developer.github.com/v4/object/repository/> and related pages. wchargin-branch: github-schema-flow-types	2018-10-19 09:04:54 -07:00
William Chargin	04f7e9ea8c	graphql: generate Flow types from a schema (#927 ) Summary: Generating Flow types from a structured schema is both straightforward and terribly useful. This commit implements it. Test Plan: Inspect the snapshot for correctness manually. Then, copy it into a new file, remove backslash-escapes, and verify that it passes Flow. A subsequent commit will generate types for the actual GitHub schema. wchargin-branch: graphql-generate-flow-types	2018-10-19 09:02:49 -07:00
William Chargin	e787db53c4	schema: allow annotating primitive field types (#926 ) Summary: Prior to this change, primitive fields were un\[i\]typed. This commit allows adding type annotations. Such annotations do not change the semantics at all, but we will be able to use them to generate Flow types corresponding to a schema. This commit also strengthens the validation on schemata to catch some errors that would previously have gone unnoticed at schema construction time: e.g., a node reference to a type that does not exist. Test Plan: Unit tests updated, retaining full coverage. The demo script continues to work when loading `example-github` or `sourcecred`. wchargin-branch: schema-annotated-primitives	2018-10-19 09:00:52 -07:00
William Chargin	ee4900616b	github: fetch commit parent oids (#924 ) Summary: To ease the transition from manual continuation resolution to the Mirror API, we update the old system to fetch commit parent OIDs, as described in #923. Test Plan: To check that the continuations are wired correctly, apply the following patch to force continuations to be followed at every step: ```diff diff --git a/src/plugins/github/graphql.js b/src/plugins/github/graphql.js index 05761ca..a21a364 100644 --- a/src/plugins/github/graphql.js +++ b/src/plugins/github/graphql.js @@ -53,7 +53,7 @@ const PAGE_SIZE_COMMENTS = 20; const PAGE_SIZE_REVIEWS = 5; const PAGE_SIZE_REVIEW_COMMENTS = 10; const PAGE_SIZE_COMMIT_HISTORY = 100; -const PAGE_SIZE_COMMIT_PARENTS = 5; +const PAGE_SIZE_COMMIT_PARENTS = 0; const PAGE_SIZE_REACTIONS = 5; /** @@ -358,6 +358,7 @@ function* continuationsFromCommit( ) { const b = build; if (result.parents && result.parents.pageInfo.hasNextPage) { + console.warn(result.parents.pageInfo); yield { enclosingNodeType: "COMMIT", enclosingNodeId: nodeId, @@ -366,7 +367,7 @@ function* continuationsFromCommit( b.field( "parents", { - first: b.literal(PAGE_LIMIT), + first: b.literal(1), after: b.literal(result.parents.pageInfo.endCursor), }, [b.fragmentSpread("commitParents")] ``` (It is important that the initial page limit be `0` and not (say) `1`, because the `defaultBranchRef` is likely to have just one parent; by using `0`, we test its continuations as long as it has at least one parent.) Then run `./src/plugins/github/fetchGithubRepoTest.sh` and note that the test passes and the output is ``` { hasNextPage: true, endCursor: null } { hasNextPage: true, endCursor: null } { hasNextPage: true, endCursor: null } { hasNextPage: true, endCursor: null } { hasNextPage: true, endCursor: null } { hasNextPage: true, endCursor: null } { hasNextPage: true, endCursor: null } { hasNextPage: true, endCursor: 'MQ==' } { hasNextPage: true, endCursor: 'MQ==' } ``` Note that the test output (as updated in this commit) includes commits with a unique parent, a commit with no parents, and a commit with two parents. I also ran `node ./bin/sourcecred.js load` on sourcecred/sourcecred and ipfs/js-ipfs. Each worked, and the resulting credit attributions loaded fine. wchargin-branch: github-commit-parent-oids	2018-10-18 19:16:44 -07:00
William Chargin	7c5923959e	github: fetch dates from commits (#919 ) Summary: This has two benefits: - The dates on commits are data that we will probably want when we add timestamps to authorship edges to accommodate time-weighted cred. - Once the mirror module is integrated with the GitHub plugin, we’ll want to fetch dates on commits, because this is the only real-world test case for a nested field that contains a primitive field (as opposed to a node reference), so it’ll be nice to be continually exercising that somewhat-edge case. Date strings are in commit-local time and do not depend on the time zone of the requester (in contrast to [cursors]). For example, on SourceCred: ```shell $ time node ./bin/fetchAndPrintGithubRepo.js \ > sourcecred sourcecred "${GITHUB_TOKEN}" \| > jq -rc ' > .repository.defaultBranchRef.target.history.nodes[] > .author?.date[-6:] > ' \| sort \| uniq -c 1 +03:00 6 -04:00 717 -07:00 58 -08:00 ``` [cursors]: <https://github.com/sourcecred/sourcecred/pull/129#issuecomment-382970474> Test Plan: The snapshot contains 8 instances of `oid` and 8 instances of `date`, which is good (each of these properties appears exactly once on each commit, and nowhere else). Running `yarn test --full` passes. wchargin-branch: github-commit-dates	2018-10-16 22:54:39 -07:00
William Chargin	1155c439b9	mirror: add support for shallow-nested fields (#918 ) Summary: This commit follows up on the previous two pull requests by drawing the rest of the owl. Resolves #915. Test Plan: Unit tests included. To verify the snapshot change, open the snapshot file, and copy everything from `query TestUpdate {` through the matching `}`, not including the enclosing quotes. Strip all backslashes (Jest adds them). Post the resulting query to GitHub and verify that it completes successfully and that the result contains a commit with an `author`. In other words, `xsel -ob \| tr -d '\\' \| ghquery \| jq .` with [ghquery]. [ghquery]: https://gist.github.com/wchargin/630e03e66fa084b7b2297189615326d1 The demo entry point has also been updated. For an end-to-end test, you can run the following command to see a commit with a `null` author (with the current state of the repository) and a commit with a non-`null` author: ``` $ node bin/mirror.js /tmp/mirror-example.db \ > Repository MDEwOlJlcG9zaXRvcnkxMjMyNTUwMDY= \ > 3600 2>/dev/null \| > jq '(.defaultBranchRef.target, .pullRequests[0].mergeCommit) \| {url, author}' { "url": "`6bd1b4c0b7`", "author": { "date": "2018-09-12T19:48:21-07:00", "user": null } } { "url": "`0a223346b4`", "author": { "date": "2018-02-28T00:43:47-08:00", "user": { "id": "MDQ6VXNlcjE0MDAwMjM=", "__typename": "User", "url": "https://github.com/decentralion", "login": "decentralion" } } } ``` You can also check that it is possible to fetch the whole SourceCred repository (ID: `MDEwOlJlcG9zaXRvcnkxMjAxNDU1NzA=`). wchargin-branch: mirror-shallow	2018-10-04 16:00:07 -07:00
William Chargin	49f0803a7a	mirror: reflect nested fields in schema info (#917 ) Summary: See #915 for context. This adds nested field data to the “useful info” data structure added in #857. Test Plan: Unit tests for `_buildSchemaInfo` updated. wchargin-branch: mirror-schemainfo-shallow	2018-10-04 15:30:04 -07:00
William Chargin	3d2206088c	schema: support shallow-nested object fields (#916 ) Summary: See #915 for context. This commit changes the `schema` module only. I had a hard time picking names that clearly distinguish the top-level field on the object and the subfields that it contains. @decentralion and I independently came up with “nest” and “egg”. It’s a bit colorful, but it’s certainly easy to remember which one is which, and it doesn’t conflict with existing notions like “parent”/“child”. Test Plan: Unit tests expanded slightly, retaining full coverage. wchargin-branch: schema-shallow	2018-10-04 15:28:58 -07:00
William Chargin	48b68b221a	link: remove `styles` attribute from child (#911 ) Summary: By using `<a {...this.props}>{children}</a>`, we were forwarding the Aphrodite selectors as `styles`. This caused the static HTML for the page to include `<a styles="[object Object]">`, which is annoying. Test Plan: Unit tests extended: they fail before this change and pass after it. Also clicked a router link and an external link in the application. wchargin-branch: link-child-styles	2018-10-03 12:14:07 -07:00
William Chargin	55950f5354	mirror: add an end-to-end `update` function (#909 ) Summary: This completes the minimum viable public API for the `Mirror` class. As described on the docstring, the typical workflow for a client is: - Invoke the constructor to acquire a `Mirror` instance. - Invoke `registerObject` to register a root object of interest. - Invoke `update` to update all transitive dependencies. - Invoke `extract` to retrieve the data in structured form. It is the third step that is added in this commit. In this commit, we also include a command-line demo of the Mirror module, which exercises exactly the workflow above with a hard-coded GitHub schema. This can be used to test the module’s behavior with real-world data. I plan to remove this entry point once we integrate this module into SourceCred. This commit makes progress toward #622. Test Plan: Unit tests included for the core functionality. The imperative shell does not have automated tests. You can test it as follows. First, run `yarn backend` to build `bin/mirror.js`. Then, run: ```shell $ node bin/mirror.js /tmp/mirror-demo.db \ > Repository MDEwOlJlcG9zaXRvcnkxMjMyNTUwMDY= \ > 60 ``` Here, the big base64 string is the ID for the sourcecred/example-github repository. (This is listed in `graphql/demo.js`, alongside the ID for the SourceCred repository itself.) The value 60 is a TTL in seconds. The database filename is arbitrary. This will print out a ton of output to stderr (all intermediate queries and responses, for debugging purposes), and then the full contents of the example repository to stdout. Run the command again, and it should finish instantly. On my machine, the main function runs faster than the Node startup time (50ms vs 60ms). Then, re-run the command, changing the TTL from `60` to `1`. Note that it sends off some queries and then prints the same repository. It is safe to kill the program at any time, either while waiting for a response from GitHub or while processing the results, because the mirror module takes advantage of SQLite’s transaction-safety. Intermediate updates will be persisted, so you’ll lose just a few seconds of progress. You can also of course dive into the generated database yourself to explore the data. It’s good fun. wchargin-branch: mirror-e2e-update	2018-10-02 21:06:01 -07:00
William Chargin	23e56f481a	mirror: paginate own-data node updates (#908 ) Summary: GitHub has an undocumented node limit on the number of IDs that can be provided to the top-level `nodes` connection. This is silly, because we can just spread the IDs over multiple identical connections. This commit implements the logic to do so. Test Plan: Create some queries that use `nodes(ids: ...)` to fetch varying numbers of objects: ```shell id="MDEwOlJlcG9zaXRvcnkxMjAxNDU1NzA=" nodes() { n="$1" ids="$(yes "$id" \| head -n "$n" \| jq -R . \| jq -sc .)" printf 'nodes(ids: %s) { __typename }' "$ids" } query() { printf '{ ' for num; do printf 'nodes_%s: %s ' "$num" "$(nodes "$num")" done printf '}' } ``` Note that the query given by `query 101` results in an error… ```json { "data": null, "errors": [ { "message": "You may not provide more than 100 node ids; you provided 101.", "type": "ARGUMENT_LIMIT", "path": [ "nodes_101" ], "locations": [ { "line": 1, "column": 3 } ] } ] } ``` …but the query given by `query 98 99` happily returns 197 node entries. wchargin-branch: mirror-paginate-own-data	2018-10-02 20:49:01 -07:00
William Chargin	1b09a7f61b	markdown: ignore references in HTML code elements (#907 ) Summary: Fixes #903. We already ignore Markdown code syntax (backticks), but prior to this commit we treated the contents of all HTML elements, including `<code>`, as normal text. As of this commit, `<code>` elements are stripped entirely. Other HTML elements, like `<em>`, are unaffected. Test Plan: Unit tests added. Also, load data for `ipfs/js-ipfs-block-service`, and observe in the UI that PR `#36` (Update aegir to version 9.0.0) no longer has any outward references. wchargin-branch: markdown-html-code	2018-10-02 20:34:49 -07:00
Dandelion Mané	4a374d755e	Hyperlink Git commits to GitHub (#887 ) This modifies the `nodeDescription` code for the Git plugin so that when given a Git commit, it will hyperlink to that commit on GitHub. It does this by looking up the corresponding `RepoId`s from the newly-added `commitToRepoId` field in the `Repository` (#884). Per a [suggestion in review], rather than hardcoding the GitHub url logic in the Git plugin, we provide them via a `GitGateway`. [suggestion in review]: https://github.com/sourcecred/sourcecred/pull/887#issuecomment-424059649 When no `RepoId` is found, it errors to console and does not include a hyperlink. When multiple `RepoId`s are available, it chooses to link to one arbitrarily. (In the future, we could amend this behavior to add links to every valid repo). This behavior is tested. Test plan: I ran the application on newly-generated data and verified that it sets up commit hyperlinks appropriately. Also, see unit tests.	2018-09-27 20:32:43 -07:00
William Chargin	65d811fb44	mirror: add helpers for full queries and updates (#898 ) Summary: An update step is now roughly as simple as: _updateData(postQuery(_queryFromPlan(_findOutdated()))) (with some option/config parameters thrown in). This makes progress toward #622. Test Plan: Unit tests included. They are light, because these functions are light. They still retain full coverage. wchargin-branch: mirror-full-pipeline	2018-09-27 19:12:47 -07:00
William Chargin	9a4c91887b	mirror: include typenames in extracted data (#897 ) Summary: These typenames are often superfluous, but sometimes they are useful. For instance, we might want to fetch the same data for `User`s, `Bot`s, and `Organization`s, but still differentiate which kind of node we fetched from an `Actor` union reference. Similarly, many timeline events may have similar signatures (like, “issue closed” vs. “issue reopened”). Test Plan: Existing unit tests have been updated; run `yarn unit`. wchargin-branch: mirror-extract-typenames	2018-09-26 21:24:30 -07:00
William Chargin	b74f1f3714	mirror: add public method `extract` (#894 ) Summary: The `extract` method lets you get data out of a mirror in a structured format. The mirror module now contains all the plumbing needed to provide meaningful value. Remaining to be implemented are some internal porcelain and a public method to perform an update step. This makes progress toward #622. Test Plan: Comprehensive unit tests included, with full coverage; run `yarn unit`. wchargin-branch: mirror-extract	2018-09-26 12:04:10 -07:00
William Chargin	90ace93f91	mirror: add helper to find unused table name (#895 ) Summary: Per <https://github.com/sourcecred/sourcecred/pull/894#discussion_r220406780>. Test Plan: Unit tests included; run `yarn unit`. wchargin-branch: mirror-unused-table-name	2018-09-26 11:37:21 -07:00
William Chargin	a5e0707cb9	ui: remove `:visited` color change for nav links (#892 ) Test Plan: Note that the nav links are now a lighter color, except when `:active` (e.g., when you’re holding down the mouse button). wchargin-branch: remove-visited	2018-09-25 16:38:46 -07:00
William Chargin	a7f29cb057	mirror: add connection object types to query plan (#891 ) Summary: This reestablishes harmony in light of #882. Test Plan: Existing unit tests suffice; run `yarn unit`. wchargin-branch: mirror-query-plan-connection-object-typename	2018-09-25 14:25:11 -07:00
William Chargin	892c498f9c	mirror: query and process own-data updates (#883 ) Summary: This commit adds internal functions to (a) emit a GraphQL query to fetch data for own-data of an object, and (b) ingest the results of said query back into the database. The API and implementation differ from the connection-updating analogues introduced in #878 in that the query for own data is independent of an object’s ID: it depends only on the object’s type. This affords us more flexibility in composing queries. As described in a internal documentation comment, values are stored in the database in JSON-stringified form: we cannot use the obvious raw SQL values, because there is no native encoding of booleans (`0`/`1` is conventional), and we need to distinguish them from other data types. There are other ways to solve this problem. Notably: 1. We could take inspiration from OCaml: encode stronger static types and a simpler runtime representation. That is, we could change the schema field types from simply “primitive” to the various specific primitive types. Then, when reading data out from the database, we could reinterpret the values appropriately. 2. We could take advantage of the fact that we are not using all of SQLite’s data types. In particular, we do not store anything as a binary blob, so we could encode `false` as a length-0 zeroblob and `true` as a length-1 zeroblob, for instance. Again, when reading data out from the database, we would reinterpret the values—but in this approach we would not need an explicit schema from the user. For now, we take the easiest and simplest approach just to get ourselves off the ground. We can easily move to the second option described above later. This commit makes progress toward #622. Test Plan: Unit tests included, with full coverage. While these tests check that the GraphQL queries are as expected, they cannot check that they are actually valid in production. To check this, follow the instructions in the added snapshot test. wchargin-branch: mirror-own-data-updates	2018-09-25 11:58:32 -07:00
William Chargin	2af8566e6a	app: add and use a shared Link component (#890 ) Summary: This will enable us to style links consistently across our application. Our previous link colors for base and `:visited` were so similar that I didn’t actually realize that they were different. In this change, I’ve kept the same base color, and selected a more contrasting `:visited` color. I also added an `:active` color, which is good for usability (color chosen via <http://paletton.com/>’s “triad” suggestion). Example screenshot, with active, visited, and base links: ![Screenshot as in this commit][img-underline] I also considered implementing the link underlines with `border-bottom` instead of `text-decoration` (an oft-touted suggestion that has always smelled fishy to me, but [the W3 does it][w3], so I guess it’s okay). That would look like this: ![Screenshot with `border-bottom` underlines][img-border] …but I did not do so ([rationale in a comment on #890][rationale]). [w3]: https://www.w3.org/TR/WCAG20-TECHS/F73.html [img-underline]: https://user-images.githubusercontent.com/4317806/45987381-f154f580-c025-11e8-8b0f-63c1e1ddce02.png [img-border]: https://user-images.githubusercontent.com/4317806/45926176-a081b780-bed4-11e8-96f2-d0d24d11c8f7.png [rationale]: https://github.com/sourcecred/sourcecred/pull/890#issuecomment-424146773 We can certainly change these decisions later—that’s one of the purposes of having this abstraction—so I’m not inclined to bikeshed on them too much in this commit. Implementation adapted from: <`80263b190e/src/components/Link.js`> Test Plan: Check that `<a>` elements and React Router link elements are used only in `Link` and snapshots: ``` $ git grep --name-only -Fw '<a' src/app/Link.js src/assets/logo/discourse_512.png src/plugins/github/__snapshots__/render.test.js.snap $ git grep '"react-router"' \| grep 'Link' src/app/Link.js:import {Link as RouterLink} from "react-router"; src/app/Link.test.js:import {Link as RouterLink} from "react-router"; src/app/createRelativeHistory.test.js:import {Router, Route, Link} from "react-router"; src/app/withAssets.test.js:import {IndexRoute, Link, Router, Route} from "react-router"; ``` Check that the primary color now appears in just one spot: ``` $ git grep -i 0872a2 src/app/Link.js: ...colorAttributes("#0872A2"), ``` Then, run `yarn start` and click all the links. Note in particular that the SVG icons in the header have the correct colors in the active state as well as the base state. wchargin-branch: app-link	2018-09-24 18:26:52 -07:00
William Chargin	3257df63fe	mirror: add helper to register nullable node results (#889 ) Summary: As <https://github.com/sourcecred/sourcecred/pull/883/files#r219648511>. It is somewhat unfortunate that this mixes a command with a query, but the concession is acceptable in this instance, I think. Test Plan: Existing unit tests suffice, retaining full coverage. wchargin-branch: mirror-register-node-field-results	2018-09-24 18:20:28 -07:00
William Chargin	28c4e497fb	mirror: factor out `_makeSingleUpdateFunction` (#888 ) Summary: This helpful utility is already used in some test code, and will shortly be used in main code. @decentralion suggested factoring it out in <https://github.com/sourcecred/sourcecred/pull/883#discussion_r219647781>. Test Plan: Unit tests included, with full coverage; run `yarn unit`. wchargin-branch: mirror-make-update-helper	2018-09-24 18:06:14 -07:00
Dandelion Mané	3b962eacea	Git: track the repositories containing each commit (#884 ) This modifies the Git `Repository` data structure so that for every commit, we track the `RepoId`s of repos containing that commit. This way we will be able to do things like hyperlink to the right url for that commit. `loadRepository` has been modified to set the initial `repoId`. `mergeRepository` has been updated to ensure that it concatenates the `repoId`s properly. Tests were added for both cases. The example-git snapshot has been updated accordingly. Test plan: `yarn test --full`	2018-09-21 17:06:14 -07:00
Dandelion Mané	b506d40efd	Add the `RepoIdString` type (#885 ) This modifies `core/repoId` so that `repoIdToString` returns not a `string`, but an opaque subtype of `string` called `RepoIdString`. This allows us to store stringified `RepoId`s (which is useful whenever we want value-not-reference semantics, like for use as a map key) while still maintaining a type assertion that the strings, in fact, represent valid `RepoId`s. Test plan: A unit test verifies that it's a flow error to cast a string to a `RepoIdString`.	2018-09-21 16:54:20 -07:00
William Chargin	838092194b	mirror: add support for connections of union types (#882 ) Summary: Almost every GitHub connection has nodes of an object type, like `User` or `IssueComment`. But a few have nodes of union type, including `IssueTimelineItemConnection` (which we will likely want to query), and those require special handling. This commit adds susupport for such connections. Analysis to determine which connections have non-object elements: <https://gist.github.com/wchargin/647fa7ed8d6d17ae2e204bd098104407> Test Plan: Unit tests modified appropriately, retaining full coverage. The easiest way to verify the snapshot is probably to copy the raw contents (everything inside the quotes) into `/tmp/snapshot`, then run: ```shell $ sed -e 's/\\//g' </tmp/snapshot >/tmp/query # Jest adds backslashes $ jq -csR '{query: ., variables: {}}' </tmp/query >/tmp/payload $ ENDPOINT='https://api.github.com/graphql' $ AUTH="Authorization: bearer ${SOURCECRED_GITHUB_TOKEN}" $ curl "$ENDPOINT" -X POST -H "$AUTH" -d @- </tmp/payload >/tmp/result ``` and then execute the JQ program mentioned in the comment in the test case, and verify that it prints `true`. wchargin-branch: connection-of-union	2018-09-21 16:20:58 -07:00
Dandelion Mané	23ae7e2f08	Rename the core `Repo` type to `RepoId` (#881 ) We have a core type called `Repo`, but it really is an identifier to a repo, rather than being a repo itself. This is confusing since we have a data type called `Repository` which actually represents the data in a repository. I've renamed `Repo` to `RepoId` for clarity. Test plan: `yarn test --full` passes. Running the frontend passes after wholly regenerating the sourcecred data directory.	2018-09-21 14:44:29 -07:00
Dandelion Mané	1e5f728e29	Cred explorer: display commit short hash + summary (#879 ) This modifies how commits are displayed in the cred explorer. Rather than printing the full hash, we now print a short hash followed by the summary. Test plan: Snapshot is updated, also I tested it by running SourceCred on a real repository.	2018-09-21 13:24:28 -07:00
William Chargin	09ed51ed6e	mirror: align test schema more with GitHub schema (#880 ) Summary: This commit changes the `Issue` type of the schema used in the `mirror` tests to have fields a faithful subset of those in the actual GitHub schema. The tests are self-contained, so this is not strictly required. However, it is convenient, because it means that we can snapshot a query that can actually be posted to GitHub. Test Plan: Running `yarn unit mirror` suffices for the code change. The GitHub schema docs at <https://developer.github.com/v4/object/issue/> indicate that each of `id`, `url`, `author`, `repository`, `title`, and `comments` is a valid field. wchargin-branch: mirror-test-schema	2018-09-20 17:17:27 -07:00
Dandelion Mané	0b3e2e7f7f	Load commits' short hash and summary (#876 ) This modifies the Git Commit type to includea short hash and a oneline summary, and modifies `loadRepository` so that we actually get that data. The example-git repository has been updated to include a commit with leading whitespace and a pipe in the summary, to ensure that these are respected. Test plan: Observe that the snapshot is updated, and the updates are correct. `yarn test --full` passes.	2018-09-20 15:51:58 -07:00
William Chargin	6ae5c56624	mirror: query and process connection updates (#878 ) Summary: This commit adds internal functions to (a) emit a GraphQL query to fetch data for a particular connection, and (b) ingest the results of said query back into the database. This commit makes progress toward #622. Test Plan: Unit tests included, with full coverage. While these tests check that the GraphQL queries are as expected, they cannot check that they are actually valid in production. To check this, follow the instructions in the added snapshot test. wchargin-branch: mirror-connection-updates	2018-09-20 15:46:03 -07:00
Dandelion Mané	5348fe68bf	Remove git adresses for trees, blobs, and entries (#877 ) In #873 I removed the data types for trees, blobs, and entries, but neglected to remove the address related code. This commit corrects that mistake. Some test cases in other modules have been removed because the failure is now structurally impossible, e.g. it is not possible that we would provide a non-commit address to the GitHub plugin, because non-commit addresses do not exist. Test plan: `yarn test --full` passes.	2018-09-20 15:18:01 -07:00
William Chargin	1dd8b7bcb7	mirror: add internal method `_findOutdated` (#875 ) Summary: This function finds all objects whose own data has not been updated since a given time, and all connections whose entries have not been updated since that time. Note that this is scoped to the entirety of the database. In #622, I discussed using a recursive common table expression to identify only those transitive dependencies of the root. I think that this is overkill for the `_findOutdated` method: you’ll usually want to update everything in the database. Don’t worry—the cool recursive query will still be used in the `extract` function. :-) This commit makes progress toward #622. Test Plan: Unit tests added, with full coverage; run `yarn unit`. wchargin-branch: mirror-findoutdated	2018-09-20 14:19:47 -07:00
Dandelion Mané	e3f04c5079	Git plugin serializes the repository and graph (#874 ) This modifies the behavior when loading the Git plugin so that it serializes the Repository as well as the graph. This will allow us to get extra information, like the commit headline, to the Git plugin in the frontend. As an added bonus, we can now refactor `loadRepositoryTest` to depend on `sourcecred.js load` rather than `loadAndPrintRepository`. As this was the only use for `loadAndPrintRepository`, we can safely delete it. This improves our test quality because it means we are also testing the actual CLI behavior. Note that the switch from using `stringify` to `json.tool` for pretty-printing has resulted in a trivial diff in the snapshot. Test plan: `yarn test --full` passes.	2018-09-20 14:07:58 -07:00
Dandelion Mané	5b8ec53b13	Remove all git tree related code (#873 ) In #627, I made a case for removing all trees and blobs from the cred graph. The issue was that the data was bloated and noisy, and did not provide much value in its current form. This commit follows on that by actually removing the code from the codebase (rather than keeping it unused). I want to delete this code because I believe: 1. It is unlikely to see further use in its current form (because collecting the entire Git tree structure is just too noisy for our purposes) 2. In the event that we do need it, reviving it will not be too difficult (because it is all quite locally scoped to the Git plugin). 3. Keeping unused code increases ongoing maintenance + development costs, and I'd like to bias towards keeping the codebase simple and lean. In the event that a future contributor is reviving this code and finds it a pain, I pre-emptively apologize to you. Test plan: `yarn test --full` passees.	2018-09-20 13:50:27 -07:00
William Chargin	ab5b6ecb68	mirror: add public method `registerObject` (#870 ) Summary: This function informs the GraphQL mirror of the existence of an object, specified by its global ID and its concrete typename (“concrete” meaning “object type”—like `User`, not `Actor`). The function will be called extensively internally as more objects are discovered while traversing the graph, but also needs to be exposed as a public entry point: a client needs to call this function at least once to register the root node of interest. A typical client workflow, once all of #622 is implemented, might be: 1. Issue a standalone GraphQL query to find the ID of a root node, like a GitHub repository: `repository(owner: "foo", name: "bar") { id }`. 2. Call `registerObject` with the ID found in the previous step. 3. Instruct the mirror to recursively update all dependencies. 4. Extract data from the mirror. As of this commit, steps (1) and (2) are possible. This commit makes progress toward #622. Test Plan: Unit tests included, with full coverage; run `yarn unit`. wchargin-branch: mirror-registerobject	2018-09-20 13:03:06 -07:00
Dandelion Mané	12aa5b7439	Add `mergeRepository` for merging Git repositories (#871 ) This commit adds a utility method, `mergeRepository`, which can merge multiple Git repository data structures. `loadGitData` has been updated to create a merged repository and then subsequently generate a graph from it. Test plan: New unit tests were added. `yarn test --full` passes. Loading a project and viewing its git data in the cred explorer works.	2018-09-20 12:57:57 -07:00
William Chargin	e572551cd8	mirror: add internal method `_createUpdate` (#868 ) Summary: It’s useful to add this simple function now because the rest of the commits required to implement #622 will want to use it extensively in test code. Actual clients of the API will not need to use it, because the concept of “updates” is an implementation detail: clients will always provide simple timestamps. Test Plan: Unit tests included, with full coverage; run `yarn unit`. wchargin-branch: mirror-createupdate	2018-09-20 11:12:21 -07:00
Dandelion Mané	cdceedef8d	Display urls in the cred explorer (#860 ) This commit modifies the plugin adapter's `nodeDescription` method so that it may return a React node. This enables the GitHub plugin's `nodeDescription` method to include hyperlinks directly to the referenced content on GitHub. This makes examining e.g. comment cred much easier. I've also made two other changes to the descriptions: - Pull requests diffs now color-encode the additions and deletions - Descriptions for comments and reviews no longer include the authors The Git plugin's behavior is unchanged. Test plan: I loaded a large repository in the cred explorer and verified that exploring comments and pulls and issues is much easier. The descriptions are as expected for every category of node. Snapshot tests updated. Fixes #590.	2018-09-20 10:48:05 -07:00
William Chargin	5debae414e	mirror: rename helper `_primitivesTableName` (#861 ) Summary: Some clients want to write const primitivesTableName = _primitivesTableName(typename); which they cannot if the function is also called `primitivesTableName`, due to ECMAScript shadowing semantics. Test Plan: Running `yarn flow` suffices; running `yarn unit` really suffices. wchargin-branch: mirror-rename-primitivestablename	2018-09-19 14:28:26 -07:00
William Chargin	1d18652459	graphql: improve coverage of `queries` module (#859 ) Summary: Each change provides real value, by either testing a plausible happy path that simply was not tested previously, or by adding an `empty`-assertion to a switch against a discriminated union type. Test Plan: For the snapshot change relating to the query formatter, note that Prettier formats the changed portion of the snapshot in the same way, by visiting <https://prettier.io/playground> and setting the parser to "graphql". (Prettier in general agrees with the stringification defined by this module, except for commas and spacing, for which we don’t bother to generate impeccably pretty output.) Run `yarn coverage` and note that the coverage of the whole `graphql` package is 100% on all axes. wchargin-branch: graphql-coverage	2018-09-18 17:57:16 -07:00
William Chargin	85efa811e0	mirror: use `SchemaInfo` in `_initialize` (#858 ) Summary: This simplifies and clarifies the code with no observable change. Test Plan: Existing unit tests suffice; run `yarn unit`. wchargin-branch: mirror-use-schemainfo	2018-09-18 16:30:43 -07:00
William Chargin	e69ff57c58	mirror: precompute some useful schema info (#857 ) Summary: This is mostly useful not for computational efficiency, but for ease of implementation: there end up being multiple places where we want to find (say) the primitive fields on an object, and having to go through the whole iterate-and-switch-and-push process repeatedly is annoying. Test Plan: Unit tests included, with full coverage; run `yarn unit`. wchargin-branch: mirror-schema-info	2018-09-18 16:24:38 -07:00
William Chargin	1b1a1e4d46	mirror: embed GraphQL schema into SQL (#849 ) Summary: This commit augments the `Mirror` constructor to turn the provided GraphQL schema into a SQL schema, with which it initializes the backing database. The schema is roughly as originally described in #622, with some changes (primarily: we omit `WITHOUT ROWID`; we add indexes; we store `total_count` on connections; and we use milliseconds instead of seconds for epoch time). Test Plan: Unit tests included, with full coverage; run `yarn unit`. wchargin-branch: mirror-sql-schema	2018-09-18 13:31:34 -07:00
Dandelion Mané	f03e3c83f2	Fix a crash when a review comment has >5 reactions (#855 ) Our continuation-fetching code failed to properly get continuations for pull request review comments, because it was only asking for more reactions on `"IssueComment"` fragments. This caused the `ensureNoMorePages` function to properly throw an error rather than proceding with incomplete data. This commit fixes the root cause by splitting `continuationsFromComment`into `continuationsFromReviewComment` and `continuationsFromIssueComment`. (Pull and issue comments are both considered 'IssueComment's.) The example-github repository has been updated to include 10 reactions to a single review comment; the example-data was updated in this commit, and all reactions have been loaded. I've also added a `console.error` statement in `ensureNoMorePages`. This only triggers when the program is about to fail, and it's useful for debugging. Test plan: `yarn test --full` passes. Paired with @wchargin	2018-09-18 12:52:05 -07:00
William Chargin	fa81c4eaa9	git: properly load empty repositories (#851 ) Summary: Fixes #850. Test Plan: Regression test added; it fails before the change and passes after it. Also, running `node ./bin/sourcecred.js load wchargin/mt` (which is a GitHub repository with no commits) now successfully loads the repository. (The cred explorer fails to process it, because it tries to normalize across GitHub users, of which there are none, but this is a known limitation and is unrelated.) wchargin-branch: fix-empty-git-repository	2018-09-17 16:36:22 -07:00
William Chargin	a93ad80ebc	mirror: initialize a GraphQL database mirror (#847 ) Summary: This commit introduces the `Mirror` class that will be the centerpiece of the persistent-loading API as described in #622. An instance of this class represents a mirror of a remote GraphQL database, defined by a particular schema. In this commit, we add the construction logic, which includes a safety measure to ensure that the database is used within one version of the code and schema. Test Plan: Unit tests included, with full coverage; run `yarn unit`. wchargin-branch: mirror-class	2018-09-17 13:53:08 -07:00
Dandelion Mané	62d3c180ee	Add GitHub reactions to the graph (#846 ) * Define Reaction edges This adds support to `github/edges` for creating edges representing GitHub reactions. These edges are not actually added to the graph. Test plan: Unit tests * Add GitHub reactions to the graph This commit adds functional support for reactions in SourceCred. Only thumbs-up, heart, and hooray reactions are supported for now, as they are all unambiguously positive; adding support for negative reactions like thumbs-down will require some more thought. The reactions are added to the graph, and new edge types have been added to the UI. Test plan: The `graphView` class has been updated to do invariant checking for the reaction edges, including that the unsupported reaction types like "THUMBS_DOWN" aren't added to the graph. I've tested this feature by downloading data for a large repository (ipfs/go-ipfs). The reaction edges appear and transfer cred reasonably. The edge types are displayed in the weight config appropriately. Builds on #839, #840, and #845.	2018-09-17 13:44:11 -07:00
Dandelion Mané	488c98c3e1	Add tensorflow-gardener to the list of bots (#841 ) Test plan: I verified that the spelling is correct	2018-09-17 13:44:02 -07:00
Dandelion Mané	33d14b9d1a	Define Reaction edges (#845 ) This adds support to `github/edges` for creating edges representing GitHub reactions. These edges are not actually added to the graph. Test plan: Unit tests	2018-09-17 13:35:47 -07:00
William Chargin	e9279bee90	mirror: add a helper function for transactions (#844 ) Summary: In implementing #622, we’ll want to run lots of things inside of transactions. This commit introduces a JavaScript API to do so more easily, properly handling success and failure cases. Test Plan: Unit tests included, with full coverage; run `yarn unit`. wchargin-branch: mirror-transaction-helper	2018-09-17 13:33:10 -07:00
William Chargin	f966ce300f	schema: make `fields` and `clauses` exact (#843 ) Summary: This affords more flexibility to clients, because an exact value can be used in place of an inexact value, but not vice versa. Test Plan: Running `yarn flow` suffices. wchargin-branch: schema-exact-type-fields	2018-09-17 12:07:52 -07:00
Dandelion Mané	1ad2cc0958	Request reactions data from GitHub (#839 ) (#840 ) This commit updates the GitHub graphql query to also fetch reactions. We update the JSON typedefs to include this new information, add continuations from comments, and update existing continuation and query code. Also, I added a safety check when updating comments for issues that was previously unnecessary but is now needed. Test plan: - `yarn test --full` passes. - Setting the page limits to 1 and running on the example-github does not error with unexhausted pages, and loads all the expected reactions. - Running on a larger repository (go-ipfs) works as expected. - I have written dependent code that consumes these reactions in the RelationalView, and works as intended, which suggests that the type signatures are correct.	2018-09-17 11:47:37 -07:00
Dandelion Mané	a2ffdf5ca8	Request reactions data from GitHub (#839 ) This commit updates the GitHub graphql query to also fetch reactions. We update the JSON typedefs to include this new information, add continuations from comments, and update existing continuation and query code. Also, I added a safety check when updating comments for issues that was previously unnecessary but is now needed. Test plan: - `yarn test --full` passes. - Setting the page limits to 1 and running on the example-github does not error with unexhausted pages, and loads all the expected reactions. - Running on a larger repository (go-ipfs) works as expected. - I have written dependent code that consumes these reactions in the RelationalView, and works as intended, which suggests that the type signatures are correct.	2018-09-14 16:10:31 -07:00
Dandelion Mané	aecf64b026	Detect references to commits (#833 ) Now that #832 gave us logic to parse references to commits, we have the RelationalView find and add these references. The actual change is a simple extension of existing reference detection logic. Test plan: Observe that the snapshots are updated with references to commits from the example-github repository. Progress on #815.	2018-09-14 11:56:16 -07:00
Dandelion Mané	c9a0d4b1b8	Modify parseReferences to detect refs to commits (#832 ) We add a new function, `findCommitReferences`, which can find both explicit url references to commits, and commit hashes. Since the commit url includes the commit hash, some extra logic is added to deduplicate them in this instance. Tests verify that this is done properly. Test plan: Unit tests cover the cases of having commit hashes, having commit urls, and having both at once.	2018-09-13 18:21:06 -07:00
William Chargin	4675b84443	graphql: validate well-foundedness of unions (#835 ) Summary: GraphQL unions are required to be unions specifically of object types. They cannot contain primitives or other union types as clauses. This is good: it means that we don’t have to worry about unions that recursively reference each other or themselves. Unions are also required to have at least one clause, but we don’t validate this because it’s not helpful for us. An empty union is perfectly well-defined, if useless, and shouldn’t cause any problems. Relevant portion of the spec: <https://facebook.github.io/graphql/October2016/#sec-Union-type-validation> Test Plan: Unit tests added, retaining full coverage; `yarn unit` suffices. wchargin-branch: graphql-schema-union-validation	2018-09-13 18:11:26 -07:00
William Chargin	7da9ef3a94	graphql: add a schema module (#834 ) Summary: This commit introduces a module for declaratively specifying the schema of a GraphQL database. See `buildGithubSchema` in `schema.test.js` for an example of the API. This makes progress toward #622, though the design has evolved some since its original specification there. Test Plan: Unit tests added, with full coverage; `yarn unit` suffices. wchargin-branch: graphql-schema	2018-09-13 18:02:14 -07:00
Dandelion Mané	7074a9dbd8	Fix the build (#831 ) There was a bad interaction between #830 and #829, wherein they both independently changed the snapshot. So they passed individually, and failed once both merged together. This fixes it. Test plan: `yarn test --full` passes.	2018-09-13 17:23:09 -07:00
Dandelion Mané	ab108653f3	Update example-github (#830 ) I added two new issues. One of them has references to commits, which is relevant for work on #815. Test plan: `yarn test --full` passes.	2018-09-13 15:46:51 -07:00
Dandelion Mané	ab85c9785b	Detect references in commit messages (#829 ) Now that the GitHub plugin knows about commit messages (#828), we can parse those commit messages to find references to other GitHub entities. Fixed a minor typing mistake along the way. Test plan: Observe that a number of references have been detected among the commits in the example GitHub repository. We mistakenly find references to wchargin because we don't have a proper tokenizer. (#481) Progress on #815.	2018-09-13 15:46:39 -07:00
Dandelion Mané	a1af9531ec	Request commit messages from GitHub (#828 ) We could get this information from the Git plugin, but since we want to use this for reference detection, it's much easier to have this follow the same pipeline as all the other GitHub reference detection code. I've updated the relational view to also remove the commit messages when compressing by removing bodies. A unit test was added to check this works as intended. See #815 for tracking. Test plan: `yarn test --full` passes. Snapshot changes are appropriate.	2018-09-13 15:15:21 -07:00
Dandelion Mané	c68cb29769	Add commit authorship to the graph (#826 ) In #824, we loaded every commit in the default branch's history into the GitHub relational view, along with authorship info. This commit actually uses that authorship info to create AUTHORS edges from the commit to the user that authored it (whenever possible). The implementation is quite simple: we just need to yield the commits when we yield all the authored entities, so that we will process their authors and add them to the graph. Also, I updated the invariant declarations in `graphView.js`, and corrected a type signature so that the new invariants would typecheck. Test plan: The snapshot update shows that commits are being added to the graph appropriately. Observe that commits which do not have a valid GitHub user as their author do not correspond to edges in the graph. See [example]. This is basically a solution to #815, but I'll defer closing that issue until I've added a few more features, like reference detection. [example]: `6bd1b4c0b7`	2018-09-13 14:19:37 -07:00
Dandelion Mané	4ad9fcf259	Add commits in the history to the RelationalView (#824 ) This builds on #821 so that every commit in the default ref's history is added as a Commit entity to the GitHub relational view. This means that these commits are also added to the graph by the GitHub plugin. In general, this will have no effect on real graphs, because these commits were already available via the Git plugin. Test plan: Observe that the snapshot changes just correspond to new commits being available to the RelationalView, and correspondingly added to the GitHub graph. `yarn test --full` passes.	2018-09-13 14:00:09 -07:00
Dandelion Mané	05f73f04ef	Change GitHub edge encoding strategy (#825 ) GitHub has a procedure for encoding node addresses into sequences of string "parts", so that we can generate unique edge addresses. Right now, the encoding strategy assumes that when we encode a node address into parts, that node address always starts with the prefix `["sourcecred", "github"]`. However, #816 makes the Git commit address a valid GitHub address, which means that this assumption no longer holds. We could start adding special-cased logic to ensure that we de-serialize Commit addresses properly, but what if we create edges between GitHub entities and other plugins' nodes in the future? It is much cleaner to remove the assumption, and serialize the full node address as parts in the edge address. This makes the GitHub edge addresses somewhat longer, but this is OK for now as we don't ever store those on disk. If, in the future, node/edge address length is a problem, we can investigate more principled and maintainable compression strategies at that time. Test plan: `yarn test --full` passes.	2018-09-13 13:54:49 -07:00
Dandelion Mané	9aca956a64	Ensure that `RelationalView` is not typed as `any` (#823 ) This fixes another instance of the notorious [facebook/flow#6400]. We also fix some type errors that were being masked as a consequence. Test plan: `yarn test` passes. [facebook/flow#6400]: https://github.com/facebook/flow/issues/6400	2018-09-13 12:53:08 -07:00
Dandelion Mané	5c46636611	Cleanup some minor mistakes (#822 ) - the capitalization of the GitObject types was incorrect - removed an outdated TODO Test plan: `yarn test` passes.	2018-09-13 12:52:58 -07:00
Dandelion Mané	2a39bd075d	Load commit authorship from GitHub (#821 ) This adds logic for retrieving every commit in the default branch's history, along with authorship information connecting that commit to a GitHub user (when available). This will allows us to do better cred tracking, especially for projects that don't always use pull requests for merging code. This results in a moderate increase in load time for the GitHub plugin. On my machine, loading SourceCred before this change takes 30s, and after this change it takes 34s. Test plan: Observe that the example-github has been updated with commits and authorship. Also, I ran the query for a larger repository (`sourcecred/sourcecred`) to verify that the continuation logic works.	2018-09-13 11:43:36 -07:00
Dandelion Mané	7d0d4fb2fa	Add GitHub commit entity (#819 ) This adds a `Commit` entity to the GitHub relational view. It has all the standard methods: commits can be retrieved en masse or by particular address, they have a URL and authors, and (de)serialize appropriately. The code for adding pull requests has been modified so that the merge commits are added as commit entities. This does not have any effect on the ultimate graph being created; the same edge is added either way. Test plan: I've extended the standard RelationalView tests to cover the `Commit` entity. The case where the commit has 0 authors is not yet tested, but will be once I add support for getting all of the commits from the example-github (we have one example of a commit that doesn't map to a user). Progress on #815.	2018-09-12 19:44:26 -07:00
Dandelion Mané	3e06c054db	Give GitHub plugin support for Commit addresses (#816 ) The Git plugin owns Commits, but the GitHub plugin also creates commits. This commit reifies that relationship by making a Git commit address a valid GitHub structured address. This is precursor work for #815, which will require adding a commit entity to the GitHub relational view. Also, this commit surfaces and fixes a minor type bug, wherein a map from strings to referent addresses was typed to hold any structured address, rather than just referent addresses. Test plan: The unit tests confirm that serializing/deserializing a Git commit address using the GitHub plugin's methods works as intended. Also, unit tests were added that verify that (de)serializing Git addresses for non-commit objects is still an error.	2018-09-12 19:06:06 -07:00
Dandelion Mané	7dc9449fe7	GitHub: Request commit authorship info (#817 ) This commit pulls the graphql fields to request commit information into a fragment, and requests GitHub authorship information (when available) for that fragment. We don't use that information yet, but we will soon. Progress on #815. Test plan: Observe that the example-github data is updated, so that we now have urls and authorship for commits. Observe that the query has updated, but no downstream code was affected. `yarn test --full` passes.	2018-09-12 18:51:06 -07:00
Dandelion Mané	335441e671	Expose node/edge prefixes publicly (#814 ) Both the GitHub and Git plugins create a `_Prefix` object for nodes and edges, which gives the respective prefixes for different node/edge types. We named it `_Prefix` because we weren't sure if these should be exported. In practice, these have proven quite useful to make generally available, and despite the `_`-naming we expose the objects outside their modules. This change renames `_Prefix` to `Prefix` to reflect the reality that these are used as public consts. Exporting them is safe as both objects are frozen. Test plan: Simple rename, `yarn test` suffices.	2018-09-12 17:18:47 -07:00
Dandelion Mané	70fe677990	Update example GitHub data (#813 ) Generated via `src/plugins/github/fetchGithubRepoTest.sh -u` `yarn test --full` passes. Closes #389. Thanks, @wchargin!	2018-09-12 17:18:37 -07:00
Dandelion Mané	737ed4d8b3	Add `MentionsAuthor` edges to the graph (#808 ) This commit builds on the work in #806, adding the `MentionsAuthorReference`s to the graph. It thus resolves #804. Empirically, the addition of these edges does not change the users' cred distribution much. Consider the results with the following 3 forward weights for the edge (results for ipfs/go-ipfs): \| User \| w=1/32 \| w=1/2 \| w=2 \| \|---------------\|-------:\|-------:\|-------:\| \| whyrusleeping \| 228.04 \| 225.69 \| 223.86 \| \| jbenet \| 102.04 \| 100.26 \| 99.53 \| \| kubuxu \| 66.60 \| 67.80 \| 69.36 \| \| ... \| — \| — \| — \| \| btc \| 22.69 \| 22.29 \| 21.38 \| The small effect on users' cred is not that surprising: the MentionsAuthor references always "shadow" a direct comment->user reference. In principle, the overall cred going to the user should be similar; the difference is that now some more cred flows in between the various comments authored by that user, on the way to the user. (And if those other comments had references, then it flows out from them, etc.) Empirically, the variance on comments' scores seems to increase as a result of having this heuristic, which is great—the fact that all comments had about the same score was a bug, not a feature. Sadly, we don't have good tooling for proper statistical analysis of the effect this is having. We'll want to study the effect of this heuristic more later, as we build tooling and canonical datasets that makes that analysis feasible. We choose to add this heuristic, despite the ambiguous effect on users' cred, because we think it is principled, and adds meaningful structure to the graph. Test plan: The commit is a pretty straightforward generalization of our existing GitHub edge logic. All of the interesting logic was thoroughly tested in the preceding pull, so this commit just tests the integration. Observe that standard (de)serialization of the edge works, that the snapshot is updated with a MentionsAuthor reference edge, and that the graph invariant checker, after update, does not throw errors. Also, I manually tested this change on the ipfs/go-ipfs repo. (It does not require regenerating data.)	2018-09-12 12:55:14 -07:00
Dandelion Mané	91f76393e8	Add logic for finding `MentionsAuthorReference`s (#806 ) A `MentionsAuthorReference` is created when a post mentions a user, and that user has authored at least one post in the same thread. Then there is a `MentionsAuthorReference` from the post to the other posts by that author. For context, see the docstrings in `mentionsAuthorReference.js`, and see #804. Test plan: Thorough unit tests have been added, which test the entire pipeline, from ingesting the data via GitHub's graphql responses, through to detecting the references. Edge cases such as self-reference and multi-reference are tested. Thanks to @wchargin for help writing this commit.	2018-09-11 20:27:31 -07:00
Dandelion Mané	bf1e85d6f4	Add `MapUtil.pushValue` for maps of arrays (#805 ) With some frequency we find ourselves needing to maintain maps whose values are arrays that we append to. `MapUtil.pushValue` is a utility method for these cases. Existing usage in `aggregate.js` has been modified to use the new function. Test plan: Unit tests included.	2018-09-11 15:26:54 -07:00
William Chargin	5fa20ec89e	test: use proper top-level `describe` blocks (#801 ) Summary: Per #800, each test file should start with a `describe` block listing its file path under `src`. Currently, nine of our tests do not do so. Of these, eight had a top-level describe block with the wrong name (either not a filepath or an outdated filepath), while only one short test was missing a top-level describe block altogether. This patch fixes each file to use the correct format. Test Plan: Apply the Sharness test in #802, and note that it fails before this patch but passes after it. wchargin-branch: describe-fix	2018-09-06 20:39:46 -07:00
Dandelion Mané	508fbc5d72	Release 0.1.0 (#799 ) Test plan: I ran `yarn test --full`. I also regenerated data from scratch and manually tested the cred explorer.	2018-09-06 19:06:16 -07:00
Dandelion Mané	bf35bbbbda	Move WeightConfig into PagerankTable (#798 ) Previously, the WeightConfig (and the button that expanded it) were in the credExplorer App. This was a little weird, as there's no reason to play with the weights before you have some Pagerank results to investigate; additionally, it risked confusing new users with a concept that was not yet applicable. Also, the implementation was wonky: the WeightConfig had responsibility for expanding/hiding itself, which gave poor ability to position the button and the WeightConfig separately. Finally, the codepath was untested (vestiges of #604). This commit fixes all three issues: - The WeightConfig and button have moved into PagerankTable - The WeightConfig is now a stateless component, and the parent takes responsibility for deciding when to mount it - Logic for showing/hiding the WeightConfig is now tested.	2018-09-06 18:58:09 -07:00
Dandelion Mané	b632bd6188	Move `WeightConfig` into the `weights` directory (#797 ) Test plan: `yarn test` sufficies for this simple move.	2018-09-06 17:29:15 -07:00
Dandelion Mané	eb065f3634	Make `credExplorer/App` control the WeightedTypes (#796 ) This commit implements a [suggestion] to make `credExplorer/App` a single source of truth on the `WeightedTypes`. As such, both `WeightConfig` and `PluginWeightConfig` have been refactored to be (essentially) stateless components that are controlled from above. I say essentially because `WeightConfig` still has its expanded state, but that will go away soon. Along the way, I've improved testing and added some new invariant checking (e.g. that `PluginWeightConfig` was passed the correct set of weights for its adapter). For the first time, there are now tests for the `WeightConfig` itself! I'm not totally done with the weight re-write, but this seems like a good time to close #604, as the whole logical sequence for setting weights is now tested. Test plan: There are new unit tests. Also, to be sure, I manually tested the UI as well. [suggestion]: https://github.com/sourcecred/sourcecred/pull/792#issuecomment-419234721	2018-09-06 17:17:57 -07:00
Dandelion Mané	ead0157960	Change WeightedTypes to contain maps (#795 ) This will make it easier to re-organize the weight components so that the WeightedTypes have a single source of truth, as described in https://github.com/sourcecred/sourcecred/pull/792#issuecomment-419234721 Test plan: Unit tests suffice.	2018-09-06 16:58:40 -07:00
Dandelion Mané	ad5ea761ea	`credExplorer/App` stores weights, not evaluator (#792 ) This commit refactors `credExplorer/App` so that instead of storing an `EdgeEvaulator` in its state, it stores `WeightedTypes` instead. This has a few benefits: - It's trivial to generate the right default value for `WeightedTypes`, so we no longer allow the variable to be nullable in the state. This simplifies logic, removes an error case, and means that we don't require the `WeightConfig` to mount before the app is usable. - `WeightedTypes` are serializable and can be tested for equality, so they are a better-behaved piece of state - We put off the information-destroying transformation as long as possible - In the future, I think we may want to move the weights/types concept into core, at which point the `WeightedTypes` will directly be consumed by the `core/attribution` module. Test plan: Unit tests are pretty thorough; to be safe, I tested the UI myself.	2018-09-06 15:29:38 -07:00
Dandelion Mané	417265a4d4	Add `MapUtil.merge` for combining maps (#794 ) See the docstring for details. Test plan: Unit tests.	2018-09-06 14:53:49 -07:00
Dandelion Mané	c2920547c0	Refactor PluginWeightConfig (#788 ) This refactors PluginWeightConfig so that it uses the `defaultWeightsForAdapter` method introduced in #787. The refactor is mildly invasive, as we switch the state from being a (mutable) `WeightedTypes` to having a (regular, read-only) `WeightedTypes`. I think this is an improvement in consistency. Test plan: Trivial refactor; unit tests+flow pass.	2018-09-05 17:39:21 -07:00
Dandelion Mané	b40c1d078d	Create a `weights` module with types and utils (#787 ) This commit creates a central `weights` module that defines all of the weight-related types, and provides some utilities for dealing with them. This way users of weight-concepts do not need to depend on a lot of random modules just to get the relevant types. The utility methods are implicitly defined a few places in the codebase: now we can avoid re-writing them, and test them more thoroughly. Test plan: Unit tests pass.	2018-09-05 17:34:29 -07:00
Dandelion Mané	d77c76082d	credExplorer gets adapters from one point (#786 ) Currently, the `credExplorer` uses the `defaultStaticAdapters`, but it imports these adapters in multiple places. If we decide to make the adapters configurable (e.g. when we start supporting more plugins) this will be a problem. This change modifies the cred explorer so that the adapters always come from a prop declaration on the app. Then the adapters are passed into the `state` module's functional entry points, rather than letting `state` depend on the default adapters directly. This change is motivated by the fact that my WeightConfig cleanup can be done more cleanly if the adapters are present as a prop on the App. Test plan: Unit tests are updated. Also, `git grep defaultStaticAdapters` reaveals that the adapters are only consumed once.	2018-09-05 16:10:11 -07:00
Dandelion Mané	f7383bbc90	Add `weightsToEdgeEvaluator` (#785 ) This commit adds `weightsToEdgeEvaluator`, a function for converting weighted node types into an `EdgeEvaluator`. This replaces the `edgeWeights` module (which was untested, and an outmoded API). Test plan: The new `weightsToEdgeEvaluator` method is well-tested. Since `WeightConfig` is still not tested, I manually verified that it still works as anticipated.	2018-09-05 13:40:40 -07:00
William Chargin	3dda4ab35c	test: invoke `yarn backend` only once (#784 ) Summary: Lots of tests need the output of `yarn backend`. Before this commit, they tended to create it themselves. This was slow and wasteful, and also could in principle have race conditions (though in practice usually tended not to). This commit updates tests to respect a `SOURCECRED_BIN` environment variable indicating the path to an existing directory of backend applications. Closes #765. Test Plan: Running `yarn test --full` passes. Prepending `echo run >>/tmp/log &&` to the `backend` script in `package.json` and running `yarn test --full` results in a log file containing only one line, indicating that the script really is run only once. wchargin-branch: deduplicate-backend	2018-09-05 12:47:54 -07:00
Dandelion Mané	bd7b7dec82	Export fallback adapter's node and edge types (#783 ) This is convenient for testing other code, where we may want to directly use the fallback types. One test has been updated in this way. I also changed the names for the fallback adapter's edges to be somewhat more readable. Test plan: Tests improved.	2018-09-05 12:34:32 -07:00
Dandelion Mané	2779770af5	Organize weights by plugin (#773 ) This commit adds PluginWeightConfig, which is responsible for adding all the weights for an individual plugin. The top-level WeightConfig now creates multiple PluginWeightConfigs. It also takes responsibility for hiding the FallbackPlugin. Test plan: The PluginWeightConfig is tested (and fairly simple). The top-level WeightConfig is not yet tested (#604), so I manually tested that the weights in the app still function.	2018-09-05 11:57:20 -07:00
Dandelion Mané	f191995c61	Fail on console errors when testing enzyme (#769 ) `testUtil.configureEnzyme` now additionally asserts, after every test, that `console.error` and `console.warn` were not called. Tests that explicitly expect such calls can still be written by manually re-mocking the relevant console method (and several examples already exist). The code that explicitly specifies this for various enzyme test files has been removed. Test plan: `git grep "not.toHaveBeenCalled"` shows only unrelated usage. `yarn test` passes. Adding a spurious console.warn to a passing test causes it to fail. Fixes #668	2018-09-05 11:12:38 -07:00
William Chargin	5e0833421a	Make `execDependencyGraph` a CommonJS module (#767 ) Summary: This simplifies interfaces everywhere. See also #216, which did the opposite of this as a temporary fix due to a Babel/Webpack interaction that no longer exists as of #766. Test Plan: Note that `node bin/sourcecred.js load sourcecred/example-git` still works (after `yarn backend`). Note that `yarn test` still works. These demonstrate that the module works from both a Webpack context and a Node context. Note that `git grep --name-only execDependencyGraph` yields exactly those files touched in this commit. Note that `yarn test --full` passes. wchargin-branch: commonjs-execDependencyGraph	2018-09-04 22:01:22 -07:00
Dandelion Mané	a133a7252f	Fix a bug that can lock the cred explorer UI (#764 ) Currently, it's possible to lock the cred explorer UI via the following sequence of actions: 1. Press the analyze cred button 2. While it is running, modify the weights After following this repro, the UI is stuck: it will say "loading" forever, and the analyze cred button is disabled. The issue is that loadGraph and runPagerank do not apply their success (or failure) state transitions if they are pre-empted by another state change. If a repo change occurs, that's the right behavior: the repo change puts the state back to `"READY_TO_LOAD_GRAPH"`, which means the UI is ready to re-load, and showing the PageRank results for the wrong repo would be very confusing. However, if an edge evaluator change occurs while loadGraph or runPagerank is happening, the state is left in the "LOADING" status (which means the analyze cred button is disabled). This commit fixes the issue via a refactor: per @wchargin's suggestion, responsibility for the edge evaluator moves from the state module out to `credExplorer/App.js`. This dramatically simplifies the state module, as we no longer need a `Substate` concept: we can simplify the state into a single sequence of states. As of the refactor, the bug is impossible. Test plan: Unit tests have been updated to maintain coverage on the refactored code. I manually tested that the bug no longer repro's, and that the cred explorer UI continues to function. I did not add a new test to protect against regression, because in the new codepath, the bug is basically impossible.	2018-09-04 19:41:37 -07:00
Dandelion Mané	67c443af99	Add default weights for edge types (#762 ) This commit modifies the edge type so that it has a `defaultForwardWeight` and `defaultBackwardWeight`, and these defaults are respected by the `WeightConfig`. I came up with reasonable-seeming defaults for the Git and GitHub plugins; these will undoubtably be more methodically tuned in the future. Test plan: `yarn test` passes, also opening the cred explorer now has the specified default weights in the WeightConfig. (Note that the forward/backward directions are reversed as described in #749.)	2018-09-04 16:00:49 -07:00
Dandelion Mané	c7e5a3b87d	Configure forward/backward edge weights separately (#749 ) This commit introduces a new component, `EdgeTypeConfig`, which is responsible for configuring the weights for a given edge type. The config creates two `WeightSlider`s: one for the forward direction, and one for the backward direction. The `DirectionalitySlider` is no longer used, and is removed. This fixes #596. So as to avoid confusion, we now describe every edge with variables, as in 'α REFERENCES β', and clarify that the weight modifies how cred flows from β to α. This necessitated the creation of an `EdgeWeightSlider`, local to the `EdgeTypeConfig`, which sets up a `WeightSlider` with the necessary greek characters. The EdgeTypeConfig is tested, so this is continuing progress towards solving #604. Test plan: I manually verified that modifying edge weights has the expected effect on cred scores. Also, some new unit tests are included.	2018-09-04 15:37:00 -07:00
Dandelion Mané	4cd45c77fc	Factor out NodeTypeConfig (#763 ) This factors `NodeTypeConfig` out of the `WeightConfig` component. The scope for a `NodeTypeConfig` is that it configures a single node type. Right now it just renders a single `WeightSlider`, but I like factoring out both for consistency with the `EdgeTypeConfig` (see #749) and because I expect we may want to add more complexity later. Test plan: The new component has some tests, also I manually tested the frontend.	2018-09-04 12:52:53 -07:00
Dandelion Mané	44407b5520	Combine loadGraph and runPagerank into one button (#759 ) * StateTransitionMachine.loadGraph reports success Step one towards #586. This will enable us to chain runPagerank after loadGraph only if the load went through successfully. Test plan: Unit tests included. * Add StateTransitionMachine.loadGraphAndRunPagerank This methods combines `loadGraph` and `runPagerank` into one method which internally chains the two method. `runPagerank` is only called if `loadGraph` was successful. Progress on #586. Test plan: The new method has attached unit tests. I implemented the unit tests via mocking, which seemed quite convenient as the method is basically a wrapper for chaining two other function calls. * Combine loadGraph and runPagerank into one button Resolves #586. The new button is called "Analyze cred". Test plan: Unit tests, also I tested it manually.	2018-09-03 14:34:14 -07:00
Dandelion Mané	1135141054	Add StateTransitionMachine.loadGraphAndRunPagerank (#758 ) * StateTransitionMachine.loadGraph reports success Step one towards #586. This will enable us to chain runPagerank after loadGraph only if the load went through successfully. Test plan: Unit tests included. * Add StateTransitionMachine.loadGraphAndRunPagerank This methods combines `loadGraph` and `runPagerank` into one method which internally chains the two method. `runPagerank` is only called if `loadGraph` was successful. Progress on #586. Test plan: The new method has attached unit tests. I implemented the unit tests via mocking, which seemed quite convenient as the method is basically a wrapper for chaining two other function calls.	2018-09-03 14:33:52 -07:00
Dandelion Mané	241905f6d8	StateTransitionMachine.loadGraph reports success (#757 ) Step one towards #586. This will enable us to chain runPagerank after loadGraph only if the load went through successfully. Test plan: Unit tests included.	2018-09-03 14:33:25 -07:00
Dandelion Mané	b92d6138d3	Improve GitHub rate limit error message (#755 ) Fixes #732; see that issue for context. Test plan: The success case still works (verified that loading sourcecred/sourcecred works). I haven't tested the error case, as getting a real RATE_LIMIT_EXCEEDED form GitHub is time-consuming, and has only happened once in practice. I'm pretty confident the code works because it's a simple adaptation of the code that catches other cases.	2018-09-03 14:33:17 -07:00
Dandelion Mané	f3fbb3940b	Remove vestigial GITHUB_DELAY_MS code (#756 ) As of #699, the GITHUB_DELAY_MS code is vestigial. It should be removed. Test plan: After rebuilding the backend, loading a repository still works.	2018-09-03 11:38:27 -07:00
Claire L	5abe16144f	Add Discord link and logo to navbar (#587 ) (#593 ) Summary: To facilitate communication and contribution, the Discord invitation has been linked. Test plan: Visual inspection and manual link clicking	2018-09-02 23:15:36 -07:00
William Chargin	e71264f5cc	Replace `oclif` with `cli` (#744 ) Summary: This commit changes the CLI to use the code in `cli` instead of `oclif`. A subsequent commit will remove the dependency on OClif altogether. Resolves #580. Test Plan: Note that `yarn backend; node bin/sourcecred.js help` works. Note that the documentation in the README is still correct. wchargin-branch: cli-replace-oclif	2018-09-02 16:11:56 -07:00
William Chargin	17172c2d96	cli: implement `load` (#743 ) Summary: This ports the OClif version of `sourcecred load` to the sane CLI system. The functionality is similar, but the interface has been changed a bit (mostly simplifications): - The `SOURCECRED_GITHUB_TOKEN` can only be set by an environment variable, not by a command-line argument. This is standard practice because it is more secure: (a) other users on the same system can see the full command line arguments, but not the environment variables, and (b) it’s easier to accidentally leak a command line (e.g., in CI) than a full environment. - The `SOURCECRED_DIRECTORY` can only be set by an environment variable, not by a command-line argument. This is mostly just to simplify the interface, and also because we don’t really have a good name for the argument: we had previously used `-d`, which is unclear, but `--sourcecred-directory` is a bit redundant, while `--directory` is vague and `--sourcecred-directory` is redundant. This is an easy way out, but we can put the flag for this back in if it becomes a problem. - The `--max-old-space-size` argument has been removed in favor of a fixed value. It’s unlikely that users should need to change it. If we’re blowing an 8GB heap, we should try to not do that instead of increasing the heap. - Loading zero repositories, but specifying an output directory, is now valid. This is the right thing to do, but OClif got in our way in the previous implementation. Test Plan: Unit tests added, with full coverage; run `yarn unit`. To try it out, run `yarn backend`, then `node bin/cli.js load --help` to get started. I also manually tested that the following invocations work (i.e., they complete successfully, and `yarn start` shows good data): - `load sourcecred/sourcecred` - `load sourcecred/example-git{,hub} --output sourcecred/examples` These work even when invoked from a different directory. wchargin-branch: cli-load	2018-09-02 16:07:46 -07:00
William Chargin	d685ebbdd4	cli: add a `common` module for environment vars (#742 ) Summary: This includes environment variables to specify the SourceCred directory and the GitHub token. Parts of this may change once #638 is resolved. Test Plan: Unit tests included, with full coverage; run `yarn unit`. wchargin-branch: cli-common	2018-09-02 16:03:38 -07:00
William Chargin	ff2d4f2fd8	cli: add `main`, `sourcecred`, and `help` (#741 ) Summary: This commit includes a minimal usage of an actual CLI application. It provides the `help` command and no actual functionality. Test Plan: Unit tests added, with full coverage. To see it in action, first run `yarn backend`, then run `node bin/cli.js help`. wchargin-branch: cli-beginnings	2018-09-02 15:53:24 -07:00
William Chargin	4c433d417e	cli: add command infrastructure and test utils (#740 ) Summary: This commit introduces the notion of a `Command`, which is simply a function that takes command-line arguments and interacts with the real world. This infrastructure will enable us to write a well-tested CLI. The `Command` interface is asynchronous because commands like `load` need to block on promise resolution (for loading GitHub and Git data). This is annoying for testing, but does not actually appear to be a problem in practice. Test Plan: Unit tests added. See later commits for real-world usage. wchargin-branch: cli-command-infrastructure	2018-09-02 15:48:47 -07:00
William Chargin	1f4a6395c8	cli: rename existing system from `cli` to `oclif` (#739 ) Summary: Per #580, we aim to remove OClif. To do so, we move the old system to a directory `oclif`, and will create the new system in the now-vacant `cli` directory. Test Plan: Note that `yarn backend` still builds, that `node bin/sourcecred.js` still has `help` and `load`, and that `git grep -wc cli` yields only `yarn.lock:9`. wchargin-branch: rename-cli-to-oclif	2018-09-02 15:44:30 -07:00
Dandelion Mané	931f07de13	Compress the RelationalView by removing post body (#747 ) Our serialized RelationalView can get quite large - in the case of TensorFlow it's over 190MB. This is a problem, as GitHub pages have a hard cap of 100MB on hosted files. As a temporary workaround, this commit introduces a method, `compressByRemovingBody`, which strips away the bodies of every post. In the longer term, we'll need a solution that scales with larger repositories, e.g. sharding the relational view into smaller pieces. Test plan: Unit tests were added. I've manually confirmed that the newly-generated views are smaller (2.1MB vs 3.3MB), and that the frontend continues to function.	2018-09-02 00:16:09 -07:00
William Chargin	7f81337d74	Store GitHub data gzipped at rest (#751 ) Summary: We store the relational view in `view.json.gz` instead of `view.json`, taking advantage of the isomorphic `pako` library for gzip encoding and decoding. Sample space savings (note that post bodies are included; i.e., #747 has not been applied): SAVE OLD (B) NEW (B) REPO 89.7% 25326 2617 sourcecred/example-github 82.9% 3257576 555948 sourcecred/sourcecred 85.2% 11287621 1665884 ipfs/js-ipfs 88.0% 20953425 2520358 gitcoinco/web 84.4% 38196825 5951459 ipfs/go-ipfs 84.9% 205770642 31101452 tensorflow/tensorflow <details> <summary>Script to generate space savings output</summary> ```shell savings() { printf '% 7s % 11s % 11s %s\n' 'SAVE' 'OLD (B)' 'NEW (B)' 'REPO' for repo; do file="${SOURCECRED_DIRECTORY}/data/${repo}/github/view.json.gz" if ! [ -f "${file}" ]; then printf >&2 'warn: no such file %s\n' "${file}" continue fi script="$(sed -e 's/^ //' <<EOF repo = '${repo}' pre_size = $(<"${file}" gzip -dc \| wc -c) post_size = $(<"${file}" wc -c) percentage = '%0.1f%%' % (100 (1 - post_size / pre_size)) p = '% 7s % 11d % 11d %s' % (percentage, pre_size, post_size, repo) print(p) EOF )" python3 -c "${script}" done } ``` </details> Closes #750. Test Plan: Comparing the raw old version with the decompressed new version shows that they are identical: ``` $ <~/tmp/sourcecred/data/sourcecred/example-github/github/view.json \ > shasum -a 256 - 63853b9d3f918274aafacf5198787e18185a61b9c95faf640a1e61f5d11fa19f - $ <~/tmp/sourcecred/data/sourcecred/example-github/github/view.json.gz \ > gzip -dc \| shasum -a 256 63853b9d3f918274aafacf5198787e18185a61b9c95faf640a1e61f5d11fa19f - ``` Additionally, `yarn test --full` passes, and `yarn start` still loads data and runs PageRank properly. wchargin-branch: gzip-relational-view	2018-09-01 10:42:30 -07:00
Dandelion Mané	8009e20e5b	Fix crash on repos with underscores and dots (#738 ) The GitHub regex in urlIdParse.js incorrectly disallowed repo names with underscores and dots. Fixes #721. To mitigate errors like this in the future, code which uses regexes to find owners and repos has been modified to all depend on the same regex pattern. Test plan: Unit tests have been updated to include the failure case (they correctly failed), and then code was updated so that the tests pass again. Also, I manually verified that loading ipfs/js.ipfs.io no longer fails. Paired with @wchargin	2018-08-31 16:18:47 -07:00
Dandelion Mané	84d505ab12	Allow repo names with underscores (#737 ) Such repos exist in practice. Test plan: Unit tests	2018-08-30 19:29:20 -07:00
Dandelion Mané	d8a16a4def	Better handling of log weights (#736 ) This commit isolates all of the log-weight behavior in the weight slider. That slider moves in log space, but the numbers printed and passed around the WeightConfig code are now always in linear-space. This should reduce confusion in the UI and for developers. This commit contains two other improvements: (#588) - Changes the (log space) range on the sliders from ±10 to ±5 - Change the order from slider, weight, name to name, slider, weight, so that there is more visual separation between the name and the weight. Test plan: Changes to the weight slider are tested. Changes to the WeightConfig aren't (#604) so I manually tested the UI.	2018-08-30 19:21:59 -07:00
Dandelion Mané	fc5c9ea589	Create canonical demoAdapters for testing (#735 ) PluginAdapters and Node/Edge types are increasingly fundamental to the cred explorer. Prior to this commit, we had no canonical demo adapters/types, and we would create ad-hoc and messy adapters whenever we needed them. This creates unnecessary repetition and lowers test quality. This commit creates a canonical demo adapter (loosely themed based on the wonderful game [Factorio]) and refactors most existing test cases to use the demo adapters. In particular, the horrible mess of pagerankTable adapters has been removed. [Factorio]: https://www.factorio.com/ I left `aggregate.test.js` untouched because I would have needed to materially re-write the tests to port them over. I added a comment so that if we ever do re-write those tests, we'll use the new demo adapters. Test plan: `yarn test` passes.	2018-08-30 15:25:42 -07:00
Dandelion Mané	761b0f1282	Factor out WeightSlider and DirectionalitySlider (#734 ) This commit factors the weight sliders used for both node and edge weights into a shared WeightSlider component, and factors out the direction slider used for edge weights into a DirectionalitySlider. Both of these components are tested. This is a step towards #604. Test plan: The specific behaviors of the sliders are well tested. Since the weight config as a whole is not tested, I manually verified by messing with the weights that node weights, edge weights, and edge directionality all affects the cred distribution as anticipated.	2018-08-30 13:43:32 -07:00
William Chargin	908dc82f4c	Don’t load trees from Git repositories (#730 ) Summary: We currently load trees and then throw them away later, because we don’t get useful signal from them. We should consider not doing that. This will be faster. Test Plan: ``` $ time node bin/sourcecred.js load tensorflow/tensorflow --plugin git real 0m33.512s user 0m35.196s sys 0m12.489s ``` Also, `yarn test --full` passes. wchargin-branch: git-deforestation	2018-08-29 22:11:19 -07:00
Dandelion Mané	d8556b618f	Add a helpful link to the cred explorer (#727 ) Adds a link titled "what is this?" that points to my gentle introduction to cred. Also, move the feedback link to be next to it and get rid of the prototype disclaimer. Test plan: Visual inspection, also a test was updated.	2018-08-29 19:19:01 -07:00
Dandelion Mané	9dfedd3dfa	Change nav bar styling (#728 ) The nav bar now takes up less vertical space, by virtue of not having a fixed height and not having bottom padding. Test plan: Visual inspection.	2018-08-29 19:16:39 -07:00
William Chargin	e3eb779a92	load: pass context arguments to subprocesses (#724 ) Summary: This fixes a bug where, if the `SOURCECRED_DIRECTORY` environment variable is set to `foo` but the `-d bar` flag is passed, then the repository registry will be written under `foo` but the plugin data will be loaded under `bar`. Test Plan: ``` $ rm -rf /tmp/good /tmp/bad $ SOURCECRED_DIRECTORY=/tmp/bad >/dev/null \ > node bin/sourcecred.js load sourcecred/example-github -d /tmp/good $ [ -d /tmp/bad ]; echo $? $ find /tmp/good /tmp/good /tmp/good/cache /tmp/good/cache/sourcecred /tmp/good/cache/sourcecred/example-github /tmp/good/cache/sourcecred/example-github/github /tmp/good/cache/sourcecred/example-github/git /tmp/good/repositoryRegistry.json /tmp/good/data /tmp/good/data/sourcecred /tmp/good/data/sourcecred/example-github /tmp/good/data/sourcecred/example-github/github /tmp/good/data/sourcecred/example-github/github/view.json /tmp/good/data/sourcecred/example-github/git /tmp/good/data/sourcecred/example-github/git/graph.json ``` wchargin-branch: load-pass-context	2018-08-29 18:42:34 -07:00
Dandelion Mané	ce52368744	Add `@codecov` to the list of bots (#723 ) Test plan: n/a	2018-08-29 17:33:33 -07:00
William Chargin	f2a8205e7b	Add logo variants for CredBot and Discourse (#717 ) Summary: The logos are the same as the original, but with different colors for the field. Test Plan: None. wchargin-branch: credbot-discourse-logos	2018-08-29 15:15:10 -07:00
Dandelion Mané	9e78f26d0a	Separate bots and users in the UI (#720 ) Fixes #696. Test plan: This is basically a config change, so I manually tested it. I ran SourceCred on gitcoinco/web, which has two bots, and verified that the bots are correctly removed from the list of users. Selecting "Bots" in the dropdown filter shows the two bots. Changing the user weight does not affect the bots' scores, and changing the bot weight does affect the bots' scores.	2018-08-29 15:14:42 -07:00
William Chargin	d4202b2304	Add a configurable feedback URL to prototype (#715 ) Summary: We can now set, at build time, a URL to be displayed at the top of the prototype, encouraging users to provide feedback. If the URL is not provided, it defaults to the appropriate topic on the SourceCred Discourse instance. The result looks like this: ![Screenshot of the feedback URL in the prototype][screenshot] [screenshot]: https://user-images.githubusercontent.com/4317806/44814824-a238b380-ab92-11e8-88c8-dfbae27ca496.png Test Plan: Unit tests added to `yarn sharness-full` and `yarn unit`. You can run `yarn start` to see the message with the default URL, or `SOURCECRED_FEEDBACK_URL=http://example.com/ yarn start` to specify a custom URL. wchargin-branch: feedback-url	2018-08-29 15:06:12 -07:00
Dandelion Mané	96d08dc97f	Detect a hardcoded list of bots (#718 ) This commit adds a hardcoded list of known bots. Building on #713, it categorizes those userlikes with the bot subtype. (Note that those users may not be bots in the GitHub ontology - GitHub doesn't actually have a clear record of which userlikes are bots.) Progress towards #696. Test plan: Observe the single snapshot change, which demonstrates that @credbot is now correctly categorized as a bot.	2018-08-29 15:01:48 -07:00
William Chargin	761b5a0875	Allow combining repositories at load time (#711 ) Summary: As a first pass toward support for analyzing whole organizations, we allow loading multiple repositories with `sourcecred load`, combining them into a single relational view and a single Git graph at load time. Test Plan: Run ``` node bin/sourcecred.js \ load \ sourcecred/example-git \ sourcecred/example-github \ sourcecred/sourcecred \ --output sourcecred/examples \ ; ``` and select `sourcecred/examples` from the web view. Filter “Repository” nodes, and note that there are three. Note that loading a single repository without `--output` still works, that loading a single repository with `--output` still works (respecting the alias name), and loading not exactly one repository without `--output` yields an appropriate error message. Note that `yarn sharness-full` still works. wchargin-branch: load-combined	2018-08-29 14:52:26 -07:00
Dandelion Mané	2001d3a699	Update GitHub example data (#716 ) I've added [a post by a bot]. Change generated by running: ```sh src/plugins/github/fetchGithubRepoTest.sh -u ``` Test plan: `yarn travis --full` passes. Note that I properly re-archived the GitHub repository. Closes #714. [a post by a bot]: https://github.com/sourcecred/example-github/issues/6#issuecomment-417104047	2018-08-29 14:24:00 -07:00
Dandelion Mané	dda9c5feff	Subtype GitHub userlikes for Users and Bots (#713 ) Userlikes now have an additional piece of data encoded in their address: whether they are a USER or a BOT. Userlikes are still handled identically by the RelationalView, which cuts down on code duplication. I haven't added ORGANIZATIONs but it will be trivial to do once we're interested in tracking them. Note that this is basically the same as how we treat comments: comments are subtyped to review comments, issue comments, and pull comments. This is the initial step towards solving #696. Test plan: Existing unit tests pass (and caught a few bugs during development!). New test cases were added to the parser. Observe that all the snapshot changes make sense. Note: As of this commit, every GitHub userlike is classified as a user, and the subtypes are not used in the application, so this commit causes no change in observable behavior.	2018-08-29 14:00:22 -07:00
Dandelion Mané	a5c909689a	Users have 1000 cred in aggregate (#709 ) This commit changes the cred normalization algorithm so that the total cred of all GitHub user nodes always sums to 1000. For rationale on the change, see #705. Fixes #705. Note that this introduces a new way for PageRank to fail: if the graph has no GitHub userlike nodes, then PageRank will throw an error when it attempts to normalize. This will result in a message being displayed to the user, and a more helpful error being printed to console. If we need the cred explorer to display graphs that have no userlike nodes, then we can modify the codepath so that it falls back to normalizing based on all nodes instead of on the GitHub userlike nodes specifically. Test plan: There is an included unit test which verifies that the new argument gets threaded through the state properly. But this is mostly a config change, so it's best tested by actually inspecting the cred explorer. I have done so, and can verify that the behavior is as expected: the sum of users' cred now sums to 1000, and e.g. modifying the weight on the repository node doesn't produce drastic changes to cred scores.	2018-08-29 12:20:57 -07:00
Dandelion Mané	5b47c504b9	Implement scoreByConstantTotal (#708 ) This commit adds the logic for computing scores so that the total score, summed across all nodes matching a NodePrefix, is a fixed constant. See #705 for context. Test plan: The logic is quite simple, and adequate unit tests are included. Note to reviewer: There is a spurious whitespace diff in the test file because the tests for the previous test block were not correctly scoped.	2018-08-29 12:03:31 -07:00
Dandelion Mané	3e77f486f2	Stop persisting users' weight choices (#706 ) Storing the user's weights in localStore enables a workflow where a user chooses their preferred weights, and brings those weights with them across projects and contexts. However, this is the wrong workflow: actually, a project chooses its weights, and when a user visits a particular project, they want to sync up with the project's choice. Giving the user the ability to modify the weights and recalculate is still important, so that they can propose improvements to the project maintainer. But implicitly keeping their modified weights, and even bringing them to other projects the user inspects, is counter-productive. This commit removes this dubious feature. (It's a feature we were likely to drop anyway, as it conflicts with #703.) As an added bonus, this code is untested, which means the feature is technical debt—so removing it reduces our technical debt! It also removes at least one known bug. Test plan: There are no tests. I manually verified that the frontend still works, and that it no longer persists weights across refresh.	2018-08-29 11:46:48 -07:00
William Chargin	0c2908dbfb	Retry GitHub queries with exponential backoff (#699 ) Summary: This patch adds independent exponential backoff to each individual GitHub GraphQL query. We remove the fixed `GITHUB_DELAY_MS` delay before each query in favor of this solution, which requires no additional configuration (thus resolving a TODO in the process). We use the NPM module `retry` with its default settings: namely, a maximum of 10 retries with factor-2 backoff starting at 1000ms. Empirically, it seems very unlikely that we should require much more than 2 retries for a query. (See Test Plan for more details.) This is both a short-term unblocker and a good kind of thing to have in the long term. Test Plan: Note that `yarn test --full` passes, including `fetchGithubRepoTest.sh`. Consider manual testing as follows. Add `console.info` statements in `retryGithubFetch`, then load a large repository like TensorFlow, and observe the output: ```shell $ node bin/sourcecred.js load --plugin github tensorflow/tensorflow 2>&1 \| ts -s '%.s' 0.252566 Fetching repo... 0.258422 Trying... 5.203014 Trying... [snip] 1244.521197 Trying... 1254.848044 Will retry (n=1)... 1260.893334 Trying... 1271.547368 Trying... 1282.094735 Will retry (n=1)... 1283.349192 Will retry (n=2)... 1289.188728 Trying... [snip] 1741.026869 Ensuring no more pages... 1742.139978 Creating view... 1752.023697 Stringifying... 1754.697116 Writing... 1754.697772 Done. ``` This took just under half an hour, with 264 queries total, of which: - 225 queries required 0 retries; - 38 queries required exactly 1 retry; - 1 query required exactly 2 retries; and - 0 queries required 3 or more retries. wchargin-branch: github-backoff	2018-08-22 11:37:29 -07:00
William Chargin	3216f5596e	Add `GitState`, `Environment` to the `VersionInfo` (#692 ) Summary: The version number displayed in the application now displays much more specific information. It now lists the Git commit from which the build was constructed, and will identify whether we have accidentally deployed a development instance (which would be slow) or an instance with uncommitted changes (which would be bad). The version information is computed during the initialization of the Webpack config. For development, this means that it is computed when you run `yarn start`, and not updated thenafter. If the stale information presents actual confusion, we would need to backport Webpack 4’s support for runtime values in `DefinePlugin` to Webpack 3 (or upgrade Webpack by a major version). Test Plan: The logic for `GitState` and `Environment` has existing tests. With both a clean tree and a dirty tree, run `yarn start` and build the static site, and check that the resulting versions are correct. wchargin-branch: use-rich-version-types	2018-08-16 13:38:13 -07:00
William Chargin	01071866be	Add `GitState`, `Environment` types to `version` (#691 ) Summary: These types will shortly be added to the global `VersionInfo`. For now, we include the types and validation logic only. Test Plan: Unit tests suffice. wchargin-branch: add-rich-version-types	2018-08-16 13:28:29 -07:00
Dandelion Mané	2d28bd5de4	Re-introduce a simplified git plugin (#685 ) This commit re-introduces the git plugin, now that it has been radically simplified as described in [1]. The new git plugin only has nodes for commits and only has commit has-parent edges. As compared to the version that was removed in #628, this plugin is far leaner. It doesn't bloat the graph (for `sourcecred/sourcecred`, the git plugin data is just 164k), and as such doesn't incur much performance penalty. Re-incorporating the git plugin also brings some tangible benefits. We already had git nodes in the graph, as the GitHub plugin attaches them to pull requests. Without any git plugin, these nodes are displayed as "uknown nodes" with ugly descriptions. Also, including a git plugin, even one that is very minimal, communicates to users that git is a source of information to SourceCred, and that they can expect more from it in the future. Note that this commit breaks backcompat for existing repositories that were locally loaded after #628. As such, it is best to `rm -rf $SOURCECRED_DIRECTORY` and start with fresh data. Also, due to a known bug in the WeightConfig, you should reset your browser's local storage. Test plan: After removing the SourceCred directory and the stale localStorage, the cred explorer nicely displays git commits, and connects them via has_parent edges. The NodeType filter allows filtering to commits as expected, and the WeightConfig shows node and edge weights for the Git plugin's nodes and edges. [1]: https://github.com/sourcecred/sourcecred/issues/627#issuecomment-413435447	2018-08-16 13:20:41 -07:00
Dandelion Mané	024ca3c262	Add `git/minimalPluginAdapter` (#690 ) The minimal git plugin adapter only provides commit nodes and has_parent edges. See #627 for context. I forked this from `git/pluginAdapter.js`, and then deleted the nodeTypes and edgeTypes which are no longer in scope. Test plan: This is a fork of untested "glue" code, and is itself still untested.	2018-08-16 12:13:42 -07:00
Dandelion Mané	a460704ea8	Add `createMinimalGraph` for a tiny git graph (#689 ) This implements the approach suggested in [1]. Instead of forking the git plugin entirely, we'll fork the createGraph method and the pluginAdapter so that we have instances that produce a lightweight git graph. createMinimalGraph is a fork of createGraph that only adds commit nodes and has_parent edges. New unit tests ensure that only the whitelisted nodes and edges appear. Supersedes #683 and #684. Test plan: `yarn test` [1]: https://github.com/sourcecred/sourcecred/issues/627#issuecomment-413623784	2018-08-16 11:38:36 -07:00
Dandelion Mané	ac8b9147c2	Cleanup unnecessary lint suppresions (#682 ) We often construct case statements over union-typed variables, and then in the default case, we use a `(type: empty)` assertion to ensure that failing to account for all the cases results in a flow error. In the past, we created an extra line for this assertion, which required some eslint suppressions. We've realized it's cleaner to inline the type assertion in the runtime error that we throw in these defaults. This code cleans everything to the new style, and removes every existing `// no-unused-expressions` invocation in the codebase. Test plan: `yarn test`	2018-08-16 11:15:00 -07:00
Dandelion Mané	e68fe19487	Rename cred explorer table columns (#680 ) The 'Score' column is renamed to 'Cred' (and its prop is renamed as well). The column which shows how a connection or aggregation contributes to a node's cred, as a percentage, has been rendered nameless. It is pretty self explanatory, and the previous name ("Connection") was meaningless. Test plan: Unit tests, also I inspected the frontend.	2018-08-15 22:22:21 -07:00
Dandelion Mané	cdb76d15a6	Display version string in the app's footer (#677 ) Some CSS magic was required. Also creates `src/app/version.js` for storing the version string. Test plan: Visual inspection of the footer in both Chrome and Firefox, both on a page with very little content (the cred explorer without a repository loaded), and on a page with more than a screen height's of content (the homepage, or cred explorer with a large repository loaded). In all cases, the footer unobtrusively appears in the lower-left hand corner at the bottom of the screen, (after scrolling past all content, if applicable).	2018-08-15 17:34:54 -07:00
William Chargin	c6a43d3b61	Fix iteration order in the rasterizer script (#676 ) Summary: The initial logo checkin in #637 included the 32px raster image, but generated it in the _wrong order_ in the rasterizer script. This commit fixes that heinous bug once and for all. Test Plan: Running `rasterize.sh` does not change the output. wchargin-branch: rasterize-32px	2018-08-15 17:24:38 -07:00
William Chargin	5ac2494586	Load plugin data even when hosted at non-root (#679 ) Summary: This commit approximately completes the implementation of #643.\* Plugin adapters are now provided an `Assets` object at `load` time, which they can use to resolve their plugin-specific API routes. \* “Approximately” because there are some non-essential pieces of legacy code that should be cleaned up. Test Plan: Unit tests modified, but it would be good to also manually test this. Run `./scripts/build_static_site.sh` to build the site to, say, `/tmp/gateway/`. Then spin up a static HTTP server serving `/tmp/` and navigate to `/gateway/` in the browser. Note that the entire application works. wchargin-branch: use-assets-in-PluginAdapters	2018-08-15 17:21:51 -07:00
William Chargin	b252a6b5de	Load repo registry even when hosted at non-root (#678 ) Summary: This commit is the next step in #643. It makes the `RepositorySelect` robust to being hosted at arbitrary gateways by accepting `Assets` and resolving the repository registry API route appropriately. Test Plan: Unit tests modified, but it would be good to also manually test this. Run `./scripts/build_static_site.sh` to build the site to, say, `/tmp/gateway/`. Then spin up a static HTTP server serving `/tmp/` and navigate to `/gateway/` in the browser. Note that you can navigate around the application and load the repository registry on the prototype without any console warnings or errors, although you cannot yet load actual graph data. wchargin-branch: use-assets-in-RepositorySelect	2018-08-15 17:12:24 -07:00
William Chargin	19af47a664	Wire `Assets` into React components (#675 ) Summary: This commit takes the next step toward #643 by exposing `Assets` to our React components at top level. Components will be expected to pass them down as appropriate; this commit does not add any actual uses. Test Plan: Apply the following patch: ```diff diff --git a/src/app/Page.js b/src/app/Page.js index 24c2602..7ac2641 100644 --- a/src/app/Page.js +++ b/src/app/Page.js @@ -24,6 +24,10 @@ export default class Page extends React.Component<{\| <Link to="/" className={css(style.navLink, style.navLinkTitle)}> SourceCred </Link> + <img + alt="fav" + src={this.props.assets.resolve("/favicon.png")} + /> </li> {routeData.map(({navTitle, path}) => NullUtil.map(navTitle, (navTitle) => ( ``` Then, observe that the favicon loads correctly and updates across page loads and refreshes in the following situations: - under `yarn start`; - after building the static site and serving from root; - after building the static site and serving from another gateway. wchargin-branch: use-withAssets	2018-08-15 16:54:53 -07:00
William Chargin	c4ecb979b3	Extract `Assets` from router with `withAssets` (#674 ) Summary: This is the last piece of major infrastructure for #643. It will enable components like `Page` and `CredExplorerApp` to receive `Assets` as a prop. A previous iteration of the same functionality used the new Context API in React v16.3. This did a good job of solving the problem in production code, and was convenient. However, it is currently intractable to test with the current state of Enzyme. It’s plausible that this might improve in the future, so if threading down the props becomes to onerous, we might check in to see how our testing libraries are doing. I expect that the threading should not be too bad, given that we do the same thing with `localStore`, which has worked (as far as I’m aware) without a hitch. Test Plan: Unit tests added; `yarn test` suffices. wchargin-branch: withAssets	2018-08-15 16:44:28 -07:00
William Chargin	ce2867a8d5	Use relative paths for router links (#672 ) Summary: As the next step for #643, this patch enables the app to be rendered at non-root gateways by incorporating the relative-path history implementation developed in #666. The app is not fully functional: our React components do not yet know how to resolve assets, and so fetches of resources like the repository will be against the wrong URLs. Test Plan: - Note that `yarn start` still works. - Run `./scripts/build_static_site.sh` to build the site into, say, `/tmp/gateway`. - Run a static web server from `/tmp/gateway/` and note that (a) the paths listed in the page source are relative, and (b) everything works as intended, with no console messages in either Chrome or Firefox. - Run a static web server from `/tmp/` and navigate to `/gateway/` in the browser. Note that the app loads properly, and that refreshes work (i.e., the `pushState` paths are real paths). Note that the repository registry cannot yet be loaded, and so PageRank cannot be run. wchargin-branch: relative-router	2018-08-15 15:35:12 -07:00
William Chargin	91f0459753	Use relative paths for lexically static assets (#671 ) Summary: This is the first observable step toward #643. Assets whose paths are known as literals at server-side rendering time are now referenced via relative paths. This means that the favicon and JavaScript bundle can be loaded from an arbitrary gateway. The actual bundle code will still only work when loaded from `/`. This commit stands alone so that the enclosing change to the Webpack config can be in as small a change as possible. Test Plan: - Note that `yarn start` still works. - Run `./scripts/build_static_site.sh` to build the site into, say, `/tmp/gateway`. - Run a static web server from `/tmp/gateway/` and note that (a) the paths listed in the page source are relative, and (b) everything works as intended, with no console messages in either Chrome or Firefox. - Run a static web server from `/tmp/` and navigate to `/gateway/` in the browser. Note that the favicon and JavaScript are correctly noted, but that the router raises an error because it is trying to load a non-existent route. (This behavior is unchanged.) wchargin-branch: relative-lexically-static	2018-08-15 15:30:23 -07:00
Dandelion Mané	094582be32	PagerankTable displays aggregated connections Previously, expanding a node would display the individual connections that contributed cred to that node. For nodes with high degree, this was a pretty noisy UI. Now, expanding a node displays "aggregations": for every type of adjacent connection (where type is the union of the edge type and the adjacent node type), we show a summary of the total cred from connections of that type. The result is a much more managable summary view. Naturally, these aggregations can be further expanded to see the individual connections. Closes #502. Test plan: The new behavior is unit tested. You can also launch the cred explorer and experience the UI directly. I have used the new UI a lot, as well as demo'd it to people, and I like it quite a bit.	2018-08-15 15:28:59 -07:00
Dandelion Mané	8df0056f08	Add `aggregationKey` to give aggregations keys For the group of aggregations returned by aggregation operation (e.g. the set of aggregations returned by a call to `flatAggregate`), the keys are unique. Test plan: `yarn test`	2018-08-15 15:28:59 -07:00
Dandelion Mané	bdf4fef2f5	Pull `Badge` out as a shared functional component Test plan: Trivial refactor; `yarn test` suffices	2018-08-15 15:28:59 -07:00
Dandelion Mané	be79e3cb1c	Add some margin on the TableRow score column (#669 ) The TableRow currently has some margin on the left, but not on the right. This is visually unbalanced, especially when expanded so depth>0, as the content on the right is at the very edge of the shaded rectangle. This commit cleans that up a bit! Test plan: Visual inspection (see screenshots in the pull request). I don't think unit tests are necessary for small visual tweaks like this.	2018-08-15 15:10:29 -07:00
William Chargin	621a93851c	Resolve a relative path to the application root (#665 ) Summary: This is necessary for #643. If we’re serving `/prototype/index.html`, we need to to use `..` to refer to the root of the site. This patch adds `rootFromPath`, which performs the relevant transformation. (The implementation is trivial, but figuring out exactly what the specification should be was not!) Test Plan: Unit tests added; `yarn test` suffices. wchargin-branch: rootFromPath	2018-08-15 13:03:19 -07:00
William Chargin	c1997d041f	Add an `Assets` resolver (#664 ) Summary: This will enable clients to obtain the path to a static asset, even when the app is not hosted at the root of a server, as outlined in #643. This module will be used for simple assets (images, etc.) and API data (fetches from `/api/**`) alike. This supersedes #663. It includes the logic from that PR (`Assets`) without the React-specific context bindings (`AssetsContext`). Test Plan: Unit tests included; `yarn test` suffices. wchargin-branch: assets-resolver	2018-08-15 12:46:09 -07:00
William Chargin	ad0e98ac2c	Add `createRelativeHistory` history implementation (#666 ) Summary: See #643 and the module docstring on `createRelativeHistory.js` for context and explanation. This patch adds `history@^3.0.0` as an explicit dependency—previously, we were depending on it only implicitly through `react-router` (which was fine then, but is not now). The dependency is chosen to match the version specified in `react-router`’s `package.json`. Test Plan: Extensive unit tests included, with full coverage; `yarn test` suffices. wchargin-branch: createRelativeHistory	2018-08-15 12:01:27 -07:00
Dandelion Mané	00bc9a9461	Add pluralized node names (#648 ) This is preparation for #502 - we want to be able to describe groups of nodes, e.g. "52 repositories", so we need a plural form for every node name. As a fly-by fix, I removed the parentheses around the node names in the fallback adapter, as these proved to look ugly/inconsistent in the UI. Test plan: `yarn test` is sufficient.	2018-08-15 11:58:12 -07:00
Dandelion Mané	f100cd02cc	PagerankTable filter defaults to GitHub users (#653 ) We humans tend to find information about humans more interesting than information about commits or pulls. The UI should accomodate this by defaulting to displaying GitHub user nodes in the cred explorer. This is implemented as a new nullable argument to the PageRankTable. If not present, then the filter defaults to showing all nodes. If the default filter is present but doesn't match any available type, an error is thrown. Test plan: The new behavior is tested. Also, I checked it in the UI and it works. Closes #651	2018-08-15 11:44:44 -07:00
Dandelion Mané	e3a4d1f2b9	ConnectionRow now shows the connection score (#658 ) Previously, the ConnectionRow showed the score of the node that was the source of the connection. I believe the UI will be more consistent and useful if it instead shows the connection score, i.e. how important that connection was to the node in scope. This combos well with PR #657. Test plan: The change is very simple, and covered by unit tests. I also verified the behavior by examining the cred explorer.	2018-08-14 15:40:49 -07:00
William Chargin	c890fe03b4	Add Node’s `path.normalize` for the browser (#650 ) Summary: This function normalizes paths like `foo/bar//../baz` to `foo/baz`. The implementation is directly copied from Node’s source code, which is available under an MIT License. I looked for a suitable NPM package, and rejected `path-normalize` and `normalize-path`. (The former is closer and explicitly purports to be the right thing, but actually isn’t.) Test Plan: Unit tests added, with full coverage except for one branch; I include a proof that the branch is unreachable. Tested on Node v8.9.4, Node v10.0.0, and Node v10.8.0. Tests pass in each case. In the latter two cases, inconsistencies between the implementation and the actual `path.posix.normalize` would cause a test failure. In the former case, they do not. (All such cases verified.) wchargin-branch: path-normalize	2018-08-14 13:04:53 -07:00
Dandelion Mané	bf65860178	Change the PagerankTable component cycle (#657 ) Currently, the PagerankTable creates components in the following pattern: ``` NodeRow (depth=0) > ConnectionRowList (depth=1) > ConnectionRow (depth=2) > ConnectionRowList (depth=3) ``` This commit changes the cycle to the following: ``` NodeRow (depth=0) > ConnectionRowList (depth=0) > ConnectionRow (depth=0) NodeRow (padding=true, depth=1) > ConnectionRowList (depth=1) > ConnectionRow (depth=1) ``` This has some nice properties: First, the context visually resets every time we return to a NodeRow, which makes it feasible to change the score column to always have a context dependent meaning: - for a node row, the score is the score of that node. - for a connection row, the score is the score contribution of that connection - (as of #502): for an aggregation row, the score is the score contribution of that aggregation We think this will be visually clear thanks to the padding around the new NodeRow, along with the new color block indicating a new scope. This design also ensures that every NodeRow has the full width available to it (rather than getting crushed into a progressively smaller area of the table), which will be very convenient for when we add renderers for the nodes. Thanks to @theliamcrawford as the idea for this came during a user study with him. Test plan: The updated unit tests should be comprehensive. Also, try expanding some rows in the cred explorer and verify that the behavior is as described.	2018-08-13 21:33:12 -07:00
Dandelion Mané	83969371d6	Tweak the cred explorer highlight color (#659 ) William and I were experimenting, and felt that this color is slightly more pleasing / harmonious with the rest of the site, and still quite legibile. Test plan: Examine the new UI, and conclude that the color choice is harmonious and legible :). Screenshot included with the PR. Paired with @wchargin	2018-08-13 21:27:21 -07:00
William Chargin	c315af4dbb	Annotate adapter set constructor return types (#661 ) Summary: Due to <https://github.com/facebook/flow/issues/6400>, patches like the following weren’t raising Flow errors: ```diff diff --git a/src/app/adapters/adapterSet.test.js b/src/app/adapters/adapterSet.test.js index 67dd3ed..ccc6ac6 100644 --- a/src/app/adapters/adapterSet.test.js +++ b/src/app/adapters/adapterSet.test.js @@ -77,6 +77,7 @@ describe("app/adapters/adapterSet", () => { const x = new TestStaticPluginAdapter(); const fallback = new FallbackStaticAdapter(); const sas = new StaticAdapterSet([x]); + sas.wat(); return {x, fallback, sas}; } it("errors if two plugins have the same name", () => { ``` A `flow type-at-pos` check indicated that the type of `sas` was indeed inferred as `any`. This patch applies the usual nonsensical fix. Better safe than spooky. Test Plan: The above patch now raises a Flow error. wchargin-branch: annotate-adapterset-constructors	2018-08-13 21:27:02 -07:00
William Chargin	4081e0d008	Use trailing slashes for all routes (#660 ) Summary: See justification in the added unit test. Test Plan: Added unit test, with justification. Also, `yarn sharness-full` passes, and `yarn start` still works. wchargin-branch: route-trailing-slashes	2018-08-13 21:25:20 -07:00
Dandelion Mané	080edb380d	TableRows can create vertical padding (#656 ) This commit adds the `showPadding` prop to `TableRow`s. If showPadding is true, then the row will have vertical padding above the row, and below the last child of the row. The padding will match the background color of the given row. The padding is implemented as extra `tr` elements that themselves contain empty tds. Test plan: The new behavior is pretty thoroughly covered by new unit tests. Additionally, if you want to see padding in the live UI, you can apply the following (slightly contrived) diff. ``` diff --git a/src/app/credExplorer/pagerankTable/Connection.js b/src/app/credExplorer/pagerankTable/Connection.js index 3a882cd..633525b 100644 --- a/src/app/credExplorer/pagerankTable/Connection.js +++ b/src/app/credExplorer/pagerankTable/Connection.js @@ -70,7 +70,7 @@ export class ConnectionRow extends React.PureComponent<ConnectionRowProps> { depth={depth} description={connectionView} connectionProportion={connectionProportion} - showPadding={false} + showPadding={depth % 3 === 0} score={sourceScore} > <ConnectionRowList ```	2018-08-13 19:11:19 -07:00
Dandelion Mané	7f18e389ea	Separate TableRow `depth` from `indent` (#655 ) Currently, as we expand nodes or connections in the PagerankTable, the rows both get more indented and attain a deeper color. Both of these behaviors are controlled by the `depth` parameter. We're going to switch the UI to a cyclic structure, where as you drill down, once you get back to a particular node, the indentation resets to base, but the color - which now indicates nested depth - continues to change. This commit sets that change up, by splitting the behvaior into two parameters: `depth`, which controls the color, and `indent`, which controls the indentation level. As a small additional tweak, the indentation formula is changed so that buttons are always indented by 5 pixels. This results in a cleaner display for nodes that have `depth>0` but `indent==0` (as the button doesn't look squahsed against the color boundary). Test plan: The change is very simple; inspecting the updated snapshots should be persuasive.	2018-08-13 18:07:13 -07:00
Dandelion Mané	c5e20f9400	Factor out TableRow as a reusable abstraction (#652 ) We currently have two components which create rows in our PagerankTable: the `NodeRow` and `ConnectionRow`. Work on #502 will result in the addition of a new one, the `AggregationRow`. It's time to stop duplicating logic (and testing) of the shared behavior for these rows, like depth-based styling, row expansion/collapse, etc. This commit pulls all the common logic to rendering rows into a single, thoroughly tested `TableRow` component. There is one observable change in the UI: when a connection percentage is not available (i.e. for NodeRows), we now leave the column empty rather than placing a dash in the column. I think this is visually cleaner. Test plan: Unit tests pass, and this part of the code base is thoroughly tested, so that's a pretty reliable indicator. I also poked around the live PagerankTable in the cred explorer, just to be safe.	2018-08-13 12:31:16 -07:00
Dandelion Mané	f1c5d3756d	Add `Trie.getLast` (#646 ) When using Tries, we often want the last matching entry for the given path, and to throw an error if one is not available. By adding this method to the API, we avoid a lot of unnecessary repetition in the code base. Test plan: Unit tests pass. As this touches the untested WeightConfig, I've also manually tested the weight config behavior.	2018-08-10 23:01:29 -07:00
William Chargin	00e2a67477	In client-side code, only import GraphQL types (#641 ) Summary: An `import *` was used for convenience, but this effects a value import in addition to a type import. By exploding the wildcard import to directly import the required types, we can shave off 2.3% of our post-gzip bundle size (131.82 KB to 128.74 KB). It’s unfortunate that we lose the namespacing, but c’est la vie. Test Plan: `yarn flow` suffices. wchargin-branch: explode-wildcard-type-import	2018-08-10 22:26:28 -07:00
Dandelion Mané	233bec4f5e	Re-disable the Git plugin (#645 ) Thanks to #642, it should now be safe to disable the Git plugin, reaping the benefits described in #628, without causing the cred explorer to crash (#631). Test plan: - `yarn travis --full` passes - The full cred explorer works: - Running PageRank does not crash the explorer - Expanding a pull request does not crash the explorer - (After clearing state) the weight config doesn't show Git weights - The filter doesn't show Git nodes	2018-08-10 19:44:02 -07:00
Dandelion Mané	86ce26acb8	Improve weight config (#644 ) This modifies WeightConfig to properly use the fallback node type, as created in #640 and merged in #642. As an additional change, it now displays type names, rather than the address parts. For example, the issue type is now displayed as `Issue`, not `["sourcecred", "github", "issue"]`. The WeightConfig is an untested mess, and I will likely re-write it entirely. (See a bevy of WeightConfig related issues: #604, #595, #588). So, not too much effort was invested in keeping high code quality in this commit. Test plan: The weight config has no tests, so I manually tested: - weights persist after page reload - node weights influence cred attribution - edge weights influence cred attribution - edge directionality influences cred attribution - weights have reasonably pretty description messages	2018-08-10 19:29:01 -07:00
Dandelion Mané	05c9f81cc0	Use AdapterSets in the cred explorer (#642 ) This takes the code from #640 and puts it into production. Test plan: Unit tests pass. The observable behavior in the cred explorer is unchanged; i.e. the addition of the FallbackAdapter did not produce new entries in the WeightConfig or in the Pagerank table options. The WeightConfig is untested, so we don't have verification of that behavior (other than that I tested it and am reporting it here). The PagerankTable code is tested, and a snapshot would fail if another option group had appeared.	2018-08-10 18:19:54 -07:00
Dandelion Mané	9edd7ac069	Add AdapterSet and FallbackAdapter (#640 ) Issue #631 revealed that our current plugin-handling code is fragile - we aren't robust to having nodes from a plugin without having that plugin present in the frontend. This commit adds `StaticAdapterSet` and `DynamicAdapterSet`, which are abstractions over finding the matching plugin adapter or type for a node or edge. It's a robust abstraction, because the adapter sets always include the `StaticFallbackAdapter` or `DynamicFallbackAdapter`, which can match any node, so we'll never get an error like #631 due to not having an adapter / type available. Also relevant: #465 Test plan: Unit tests included.	2018-08-10 18:08:49 -07:00
William Chargin	33763a9f35	Allow delay between GitHub queries (#626 ) Summary: This patch considers an environment variable `GITHUB_DELAY_MS`. If the value is set to a positive integer, a delay of the given number of milliseconds will be incurred before each query to GitHub. This is to decrease the probability of being rate-limited; see #350 for details. This in turn unblocks us to load SourceCred data for larger repositories. The use of an environment variable is something of a hack to get this off the ground. See #638 for long-term plans. Test Plan: Run `time node ./bin/sourcecred.js load sourcecred/example-github` with varying values for `GITHUB_DELAY_MS`. Note that with the variable unset, set to zero, set to a negative number, or set to a non-numeric value, the job completes quickly; when the delay is set to `5000`, the job takes an extra five seconds. wchargin-branch: delay-github-queries	2018-08-10 17:53:26 -07:00
Dandelion Mané	378f627a6f	Add `aggregateFlat` and `flattenAggregation` (#629 ) For #502: The UI that I currently have in mind displays aggregations grouped by connection type and node type together, rather than nested. I think it will be cumbersome to have multiple hierarchical levels of expansion. To make that UI easy to write, this commit adds some logic for flattening the hiearchical aggregation from #624. I add an extra translation to flatten, rather than just having the logic produce nested structures, because it's convenient to keep around the nested structure in case I decide to implement the hierarchical UI instead. Once we have solidified how we want the UI to behave, we might choose to simplify this code. Test plan: The implementation is rather simple. There are some unit tests.	2018-08-10 17:05:59 -07:00
Dandelion Mané	d8db763257	Move core plugin adapter code to its own directory (#632 ) This commit creates the directory `src/app/adapters` and moves the following three files into it: - `src/app/pluginAdapter.js` - `src/app/pluginAdapter.test.js` - `src/app/defaultPlugins.js` This is in preparation for a principled fix for #631, which will add a base plugin and some logic that ensures it's always included.	2018-08-10 17:05:44 -07:00
William Chargin	3eb2b6eec6	Add a favicon (#637 ) Summary: In addition to the obvious benefit of having a favicon, this gets rid of a 404 Not Found error on our home page, tremendously boosting our hacker cred. Test Plan: The favicon is displayed in both `yarn start` and the static site (as a result of the build script). The added build test fails before this change. wchargin-branch: add-favicon	2018-08-10 13:15:49 -07:00
William Chargin	bb12f933a5	Check in SourceCred logo files (#636 ) Summary: The SVG was artisanally crafted by yours truly, and rasterized by the accompanying script (which is fully deterministic, and also artisanal). Test Plan: Run `./src/assets/logo/rasterize.sh`, and note that the output is unchanged. wchargin-branch: add-logo	2018-08-10 13:01:36 -07:00
William Chargin	8f2d2cd5cd	Remove service workers entirely (#635 ) Summary: This is a follow-up to #514, wherein we disabled new service workers and instructed any existing service workers to self-destruct. (See that PR for the rationale.) This commit removes them from our codebase entirely, enabling us to slim down our build process and our build output. Test Plan: Running `yarn start` still works. Building the static site and exploring it works, too. wchargin-branch: remove-sw	2018-08-10 12:49:45 -07:00
William Chargin	10930c42af	Use `hydrate` instead of `render` on the client (#634 ) Summary: This fixes the following warning on our development instance: > Warning: render(): Calling ReactDOM.render() to hydrate > server-rendered markup will stop working in React v17. Replace the > ReactDOM.render() call with ReactDOM.hydrate() if you want React to > attach to the server HTML. We do in fact want to attach to the server HTML, so we apply the suggested patch. (The warning of course also applies to production, but warnings do not appear in production.) Test Plan: Running `yarn start` shows that the above warning has disappeared, and that the cred explorer still works. (Also, `yarn test --full` passes, but that tells us effectively nothing because this code path is never hit in tests: it only affects the HTML that is executed in the browser. Erasing the entire module, leaving only `// @flow`, still lets tests pass.) wchargin-branch: migrate-to-hydrate	2018-08-10 11:54:14 -07:00
Dandelion Mané	885ff90f62	Revert "Disable the Git plugin (#628 )" (#633 ) This reverts commit `8c70f03122`. Context: This introduced a serious bug (#631), so we're reverting it to get the codebase back in a working state. Meanwhile, I'll develop a principled solution. Test plan: I rebuilt the backend, re-loaded a graph, and loaded it in the frontend. PageRank, the cred explorer, and the weight config all work. Opening a pull request does not trigger a crash.	2018-08-10 11:40:20 -07:00
Dandelion Mané	bb59efbfe6	Implement core aggregation logic (#624 ) This implements two methods: `aggregateByNodeType` groups `scoredConnection`s by the specified `NodeTypes`, along with summary statistics. `aggregateByConnectionType` groups `scoredConnection`s by `ConnectionType` at the top level, where `ConnectionType` includes `EdgeType` and direction, (and also captures synthetic self-loops). Then it also groups by `NodeType` within any aggregation. This is progress towards #502. Test plan: unit tests included.	2018-08-09 23:13:46 -07:00
Dandelion Mané	8c70f03122	Disable the Git plugin (#628 ) See #627 for context. Removing the Git plugin results in an enormous performance improvement. In my testing on `metamask/metamask-extension`: before this change, load took 23s, and PageRank took >9 mins and then crashed. After this change, load+PageRank took 5s combined. Test plan: All unit tests pass; loading new data from the CLI works; and I poked around the UI to make sure there were no spurious references to the Git plugin. Note: This does not break backcompat, there's no need to regenerate any already-loaded data.	2018-08-09 23:08:01 -07:00
Dandelion Mané	51acc25f12	Add type signatures for aggregation logic (#623 ) This is the first real step towards #502. Factoring this out because deciding the type signatures was non-trivial, and the work was paired with @wchargin. Test plan: `yarn test`	2018-08-09 22:37:12 -07:00
Dandelion Mané	74e00b0bfd	Split PagerankTable into smaller files (#630 ) PagerankTable is getting a bit unwieldy, especially as #502 will need to add a new pair of components. This commit splits the erstwise PagerankTable.js into four files: - `pagerankTable/shared`, shared utils and types - `pagerankTable/Node`, the `NodeRow` and `NodeRowList` - `pagerankTable/Connection`, the `ConnectionRow`, `ConnectionRowList`, and `ConnectionView` - `pagerankTable/Table`, the `PagerankTable` itself This commit makes no logical changes; it is purely a reorganization. Test plan: `yarn test`	2018-08-09 21:33:40 -07:00

... 4 5 6 7 8 ...

987 Commits