Compatibility and Relationship Survival

1. Study Overview (The "Aggregate Score" Test)

This project tests the validity of the most common concept in pop-astrology: The Compatibility Score. Apps and websites often give users a percentage (e.g., "93% Match") based on summing up all the "good" and "bad" connections between two charts.

The Hypothesis: If astrological compatibility is cumulative, then couples with a higher "Total Harmonic Score" (Sum of planetary interactions) should have relationships that last longer (higher survival rate) than couples with a lower score.

2. Methodology: Large-Scale Survival Analysis

A. The Metric: Total Harmonic Score

We utilize a continuous mathematical model to quantify the total resonance between two birth charts using Cosine Similarity. $$ \text{Interaction} = \cos(\theta_{P1} - \theta_{P2}) $$

B. The Dataset: N = 2,722

Unlike our initial pilot study (N=166), this analysis uses a massive dataset of 2,722 couples scraped from Wikidata, providing high reliability for relationship start/end dates.

C. Survival Analysis

We perform Kaplan-Meier Survival Analysis, treating Divorce/Death as the "Event" and Ongoing relationships as "Censored."

3. Results Overview: The Null Hypothesis

The Result is Null. There is zero difference ($r \approx 0.009$) between the survival rates of highly compatible and incompatible couples when using an aggregate score.

4. Key Files

5. How to Run

# Run the survival analysis script
python analysis_survival.py

7. The Regression Formula

We ran an ElasticNet Regression to find the "Best Fit" equation. The result ($R^2=0.02$) confirmed that Personal Planets (Sun/Moon/Venus/Mars) have weighting coefficients of nearly zero. The only predictive factors were Generational (Age) Markers:

Duration = 18.5 + (0.57 * Pluto-Pluto) ... (Detailed formula in RESULTS.md)

Data Provenance

Relationship Compatibility