a reputation protocol for open collaboration
Go to file
Dandelion Mané 79017a477b
Add support for seed vectors to PagerankGraph (#1135)
This commit modifies `PagerankGraph.runPagerank` so that the user can
provide an alpha and seed vector. The seed vector is specified via a map
of weights, which will be normalized into a probability distribution
over all the nodes in the graph. In the event that the map is empty (or
the total weight is otherwise 0), a uniform distribution is created.

To effect this change, a helper function called `weightedDistribution`
has been added (and thoroughly tested) in the `graphToMarkovChain`
module. Then, that function is used in `pagerankGraph.runPagerank`
(along with light testing).

Currently, the default alpha is set to 0, to ensure consistency with the
legacy pagerank implementation in `analysis/pagerank`. Once that has
been replaced with `PagerankGraph`, we can consider changing the defualt
alpha to non-zero (thus removing the need for synthetic self-loops).

I took a different approach in the [odyssey-hackathon repo][commit].
The previous approach was a much more complicated (and fairly redundant)
API, that allowed specifying "NO_SEED", "UNIFORM_SEED", "SELECTED_SEED",
and "SPECIFIED_SEED". I'm much happier with this API and implementation.

[commit]: ed07861073

Test plan: Unit tests included; run `yarn test`.
2019-05-05 18:57:41 +03:00
.circleci ci: run `test_full` on all merges to master (#976) 2018-11-01 10:57:53 -07:00
config Add `fetchGithubOrg` for loading organizations (#1117) 2019-03-19 19:00:08 -07:00
flow-typed/npm chore: update flow-typed libdefs (#932) 2018-10-22 10:05:11 -07:00
scripts Add sourcecred/{research,pm} to sourcecred.io (#1125) 2019-04-12 10:14:38 +02:00
sharness Automatically run pagerank on `sourcecred load` (#1115) 2019-04-11 21:21:29 +02:00
src Add support for seed vectors to PagerankGraph (#1135) 2019-05-05 18:57:41 +03:00
.eslintrc.js Remove obsolete eslint TODOs (#872) 2018-09-20 12:52:07 -07:00
.flowconfig Flow: enable `//$ExpectFlowError` (#315) 2018-05-29 13:56:36 -07:00
.gitignore Configure Webpack for backend applications (#84) 2018-03-18 22:43:23 -07:00
.mailmap meta: add .mailmap entry for Dandelion (#1108) 2019-02-26 15:46:06 +11:00
.prettierignore prettier: ignore sharness/ (#866) 2018-09-19 18:12:38 -07:00
.prettierrc.json Move package json to root (#37) 2018-02-26 22:32:23 -08:00
CHANGELOG.md Add `sourcecred pagerank` for backend pagerank (#1114) 2019-03-25 18:05:58 -07:00
CONTRIBUTING.md Update Contributing.md (#1107) 2019-02-23 08:37:32 -05:00
LICENSE license: relicense under MIT + Apache-2 (#896) 2018-09-26 19:28:41 -07:00
LICENSE-APACHE license: relicense under MIT + Apache-2 (#896) 2018-09-26 19:28:41 -07:00
LICENSE-MIT license: relicense under MIT + Apache-2 (#896) 2018-09-26 19:28:41 -07:00
README.md Fixes broken link README (#1122) 2019-03-26 16:01:39 -07:00
package.json Tests use yarn not npm (#1038) 2019-01-07 14:38:21 -08:00
yarn.lock deps: upgrade `flow-bin@^0.86.0` (#1002) 2018-11-09 09:24:40 -08:00

README.md

SourceCred

Build Status Discourse topics Discord

SourceCred creates reputation networks for open-source projects. Any open-source project can create its own cred, which is a reputational metric showing how much credit contributors deserve for helping the project. To compute cred, we organize a projects contributions into a graph, whose edges connect contributions to each other and to contributors. We then run PageRank on that graph.

To learn more about SourceCreds vision and values, please check out our website and our forum. One good forum post to start with is A Gentle Introduction to Cred.

For an example of SourceCred in action, you can see SourceCreds own prototype cred attribution.

Current Status

We have a prototype that can generate a cred attribution based on GitHub interactions (issues, pull requests, comments, references, etc.). Were working on adding more information to the prototype, such as tracking modifications to individual files, source-code analysis, GitHub reactions, and more.

Running the Prototype

If youd like to try it out, you can run a local copy of SourceCred as follows. First, make sure that you have the following dependencies:

Then, run the following commands to clone and build SourceCred:

git clone https://github.com/sourcecred/sourcecred.git
cd sourcecred
yarn install
yarn backend
export SOURCECRED_GITHUB_TOKEN=YOUR_GITHUB_TOKEN
node bin/sourcecred.js load REPO_OWNER/REPO_NAME
# this loads sourcecred data for a particular repository
# it can take a few mins to run and will exit when finished
yarn start
# then navigate to localhost:8080 in your browser

For example, if you wanted to look at cred for ipfs/js-ipfs, you could run:

$ export SOURCECRED_GITHUB_TOKEN=0000000000000000000000000000000000000000
$ node bin/sourcecred.js load ipfs/js-ipfs

replacing the big string of zeros with your actual token.

You can also combine data from multiple repositories into a single graph. To do so, pass multiple repositories to the load command, and specify an “output name” for the repository. For instance, the invocation

node bin/sourcecred.js load ipfs/js-ipfs ipfs/go-ipfs --output ipfs/meta-ipfs

will create a graph called ipfs/meta-ipfs in the cred explorer, containing the combined contents of the js-ipfs and go-ipfs repositories.

Early Adopters

Were looking for projects who want to be early adopters of SourceCred! If youre a maintainer of an open-source project and would like to start using SourceCred, please reach out to us on our Discord or our forum.

Contributing

Wed love to accept your contributions! You can reach out to us by posting on our forum, or chatting with us on Discord. We'd be happy to help you get started and show you around the codebase. Please also take a look at our contributing guide.

If youre looking for a place to start, weve tagged some good first issues.

License

SourceCred is dual-licensed under Apache 2.0 and MIT terms:

Acknowledgements

Wed like to thank Protocol Labs for funding and support of SourceCred. Wed also like to thank the many open-source communities that produced the software that SourceCred is built on top of, such as Git and Node.