diff --git a/README.md b/README.md index 18fc345a13405f20b4f3d6bfc3431a4f60a8da3d..2e1b8a9e1f6a03a68ba9ea2a95cca95101e54f7c 100644 --- a/README.md +++ b/README.md @@ -1,2 +1,16 @@ # higgs-data-analysis +Dataset from: http://snap.stanford.edu/data/higgs-twitter.html + +Scripts: + 1. r_script.r is used to gather network position information for all nodes + 2. create_user_data.py is used to gather all user id's and save it to a file + 3. analysis.py is used to gather the information when users first received the news about higgs boson + 4. merge_csv.py merges the interaction data with the network position data + +Results: + - users.txt : File with all user id's + - interactions.csv: Contains for all users the earliest_timestamp, tie_value and number of channels + - degree_follower.csv: Contains network position information + - data_analysis_results.csv : Merged network position with interactions. Final dataset +