Hello,
I have a question - I have differences calculated between game genres. The difference is a positive float number, the bigger the number, the greater the difference there is between the two genres.
I want to visualise differences and I have the following code:
import json
import networkx as nx
import matplotlib.pyplot as plt
with open('genres_weights.json', 'r') as file:
data = json.load(file)
G = nx.Graph()
max_diff = max(item['difference'] for item in data) if data else 1.0
for item in data:
node1, node2 = item['weightsPair']
difference = item['difference']
weight = item['difference'] + 0.25
G.add_edge(node1, node2, weight=weight, original_diff=difference)
plt.figure(figsize=(40, 20))
pos = nx.kamada_kawai_layout(G, weight='weight')
nx.draw_networkx_nodes(G, pos, node_size=2000, node_color='#2b83ba', alpha=0.9)
nx.draw_networkx_labels(G, pos, font_size=7, font_family='sans-serif')
plt.show()
that gives the following result for my data:

A lot of things look great, and overall graph represents data correctly (I guess). But there is the thing - in the bottom left part of the graph there are two bubbles: "immersive sim" and "rhythm". Those two genres appear to be very similar (as some other pairs of games that are very similar and have a very low number for difference), but in reality, they are not - they have a difference of 9, which is a lot (the maximum difference between genres is around 14), so I expect them to be on the different side of the graph and not nearly together.
I'm not sure where the problem is. Can someone please help me?
Are there best practices for time series database designs?
1y 6mon ago by programming.dev/u/jupyter in data_engineering@programming.devFun with Hy and Pandas
1y 8mon ago by slrpnk.net/u/houseofleft in data_engineering@programming.dev from benrutter.github.ioEfficiently Manage Memory Usage in Pandas with Large Datasets
1y 10mon ago by lemmy.ml/u/sem in data_engineering@programming.dev from geekpython.inShift Left
1y 10mon ago by lemmy.world/u/nydas in data_engineering@programming.dev from medium.comDremio is offering free pdf copies of "Apache Iceberg: The Definitive Guide: Data Lakehouse Functionality, Performance and Scalability on the Data Lake"
1y 10mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from hello.dremio.comPostgres vs. Pinecone | Lantern Blog | Narek Galstyan | July 18, 2024
1y 10mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from lantern.devDefinite: Comparing Iceberg Query Engines (with Duckdb and Iceberg Full Notebook Example) | Steven Wang | 7/3/2024
1y 11mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from www.definite.appA guide how to adopt an existing Spark scala library for Spark Connect
1y 11mon ago by lemmy.ml/u/sem in data_engineering@programming.dev from semyonsinchenko.github.ioWhy Use Data Build Tools (dbt)
2y 10d ago by lemmy.world/u/nydas in data_engineering@programming.dev from medium.com7 best open-source chart libraries for developers
2y 1mon ago by lemmy.world/u/gecloslatitude in data_engineering@programming.dev from dev.toBuilding a real-time data pipeline - Technical article and GitHub repo
2y 1mon ago by lemmy.world/u/nydas in data_engineering@programming.dev from medium.comDiagrams as Code
2y 1mon ago by lemmy.world/u/nydas in data_engineering@programming.dev from medium.com6 Best Embedded Databases for 2024
2y 1mon ago by lemmy.world/u/gecloslatitude in data_engineering@programming.dev from dev.toBuilding Meta’s GenAI Infrastructure
2y 3mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from engineering.fb.comBuilding data abstractions with streaming at Yelp
2y 3mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from engineeringblog.yelp.comBuilding a Data Pipeline from Scratch
2y 3mon ago by lemmy.world/u/nydas in data_engineering@programming.dev from medium.comData Newbie Looking for Advice
2y 4mon ago by lemmy.ca/u/Pyr_Pressure in data_engineering@programming.devAn implementation of Apache Spark physical execution from Apple
2y 4mon ago by lemmy.ml/u/sem in data_engineering@programming.dev from github.comInfrastructure-as-Code Demo of Terraform on Snowflake
2y 4mon ago by lemmy.world/u/nydas in data_engineering@programming.dev from github.comSpark vs Presto: A Comprehensive Comparison
2y 4mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from www.analyticsvidhya.comOffline listening and speaking bot
2y 4mon ago by lemmy.world/u/nydas in data_engineering@programming.dev from github.comCeph: A Journey to 1 TiB/s - Ceph
2y 4mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from ceph.ioI'd like to Volunteer to Moderate
2y 4mon ago by programming.dev/u/ericjmorey in data_engineering@programming.devData Organization in Spreadsheets
2y 4mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from www.tandfonline.comDatabase Fundamentals
2y 6mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from tontinton.comHow Data is Stored for Analytics | A Primer
2y 6mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from github.comData Engineering: A Formula 1 Inspired Guide for Beginners | A Glossary with Use Cases for First-Timers in Data Engineering
2y 6mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from freedium.cfdThere is no Data Engineering roadmap
2y 7mon ago by programming.dev/u/ericjmorey in data_engineering@programming.dev from www.alasdairb.com3 Key Takeaways from Airflow Summit 2023
2y 8mon ago by lemmy.world/u/fritz_astro in data_engineering@programming.dev from www.astronomer.io3 Key Takeaways from Airflow Summit 2023
2y 8mon ago by lemmy.world/u/fritz_astro in data_engineering@programming.dev from www.astronomer.ioAirflow Summit 2023 - Recordings Now Available
2y 8mon ago by lemmy.world/u/fritz_astro in data_engineering@programming.dev from www.youtube.comAirflow Summit 2023 - Recordings Now Available
2y 8mon ago by lemmy.world/u/fritz_astro in data_engineering@programming.dev from www.youtube.comBloom filters: real-world applications
2y 9mon ago by programming.dev/u/Reader9 in data_engineering@programming.dev from llimllib.github.ioHollow (toolset for disseminating in-memory datasets)
2y 9mon ago by programming.dev/u/Reader9 in data_engineering@programming.dev from hollow.how(2017) Rise of the Data Engineer
2y 10mon ago by programming.dev/u/Reader9 in data_engineering@programming.dev from medium.comHow is the job market for Data Engineers?
2y 11mon ago by programming.dev/u/Sl00k in data_engineering@programming.devDiscord Migrates Trillions of Messages from Cassandra to ScyllaDB
2y 11mon ago by programming.dev/u/ndotb in data_engineering@programming.dev from www.infoq.comThe Rise of the Semantic Layer: Metrics On-The-Fly
2y 11mon ago by programming.dev/u/Golang in data_engineering@programming.dev from airbyte.com
















