The document presents a framework for feature selection (FS) adapted for big data environments using Apache Spark, demonstrating improved performance and accuracy compared to traditional methods. It highlights the challenges of processing high-dimensional datasets and showcases the framework's ability to efficiently handle large datasets by employing distributed programming techniques. Future research will focus on developing new FS methods for high-speed data streams, analyzing selection impacts on high-dimensional data, and creating an automatic FS system.