06-May-2024
https://www.youtube.com/@FLaNK-Stack
FLaNK / KNIFe AI / FLaNK-AIM Weekly
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
*This is Issue #136 *
https://github.com/tspannhw/FLiPStackWeekly
https://www.cloudera.com/solutions/dim-developer.html
New Releases
Articles
https://medium.com/@tspann/small-language-models-sml-for-the-win-ea0c6fee8061
https://medium.com/@tspann/maybe-four-smaller-open-llm-s-are-better-than-one-93f78fb69eb9
https://medium.com/@tspann/building-a-milvus-connector-for-nifi-34372cb3c7fa
https://medium.com/@tspann/searching-slack-from-apache-nifi-9ed562aa2397
https://medium.com/@tspann/events-streams-flows-and-maps-22a8d27cd9b4
https://medium.com/@tspann/storing-meetup-user-data-as-events-dad3b1dc89f5
https://medium.com/@tspann/real-time-in-boston-part-1-0f92d7da3496
https://zilliz.com/learn/Sentence-Transformers-for-Long-Form-Text
https://zilliz.com/zilliz-cloud-pipelines
https://huggingface.co/BAAI/bge-large-en-v1.5
https://github.com/cloudevents/sdk-python/blob/main/samples/http-json-cloudevents/client.py
https://medium.com/@tspann/building-a-milvus-connector-for-nifi-34372cb3c7fa
https://hazelcast.com/glossary/streaming-data/
https://postgres.ai/blog/20220525-common-db-schema-change-mistakes
https://medium.com/cloudera-inc/consuming-rss-feeds-from-flink-sql-eaf33c1a5a23
https://medium.com/cloudera-inc/adding-generative-ai-results-to-sql-streams-513e1fd2a6af
https://blog.mozilla.ai/local-llm-as-judge-evaluation-with-lm-buddy-prometheus-and-llamafile/
https://blog.mozilla.ai/open-source-in-the-age-of-llms/
https://www.pinecone.io/learn/structured-data/
https://medium.com/@stoty/a-bug-for-ages-fixing-time-zone-handling-in-apache-phoenix-e9934d7acd80
https://www.geeknarrator.com/blog/stream-processing/stream-processing-concepts
https://blog.cloudera.com/setting-up-and-getting-started-with-clouderas-new-sql-ai-assistant/
https://thenewstack.io/how-to-cure-llm-weaknesses-with-vector-databases/
https://zilliz.com/blog/how-to-evaluate-and-optimize-performance-of-milvus-storage
https://datavolo.io/2024/05/apache-nifi-designed-for-extension-at-scale/
Videos
Generative AI with Milvus
https://www.youtube.com/watch?v=IfWIzKsoHnA
Four Models at Once
https://youtu.be/xvNgsZyfo6A?si=zxwc9VcFc3o0vU3P
Search Slack
https://www.youtube.com/watch?v=3ugppfb2kN8&t=5s&ab_channel=DatainMotion-HowToBeaStreamingEngineer
MBTA Transit Live with LLM
https://www.youtube.com/watch?v=JGGY_uzQTdY&t=3s&pp=ygUOVGltIFNwYW5uIE5pRmk%3D
Events, Streams, Maps with Irish Rail
https://www.youtube.com/watch?v=14CSQRfUWoE&t=684s&pp=ygUOVGltIFNwYW5uIE5pRmk%3D
FLaNK AI Channel
https://www.youtube.com/@FLaNK-Stack
NiFi
https://www.youtube.com/watch?v=m-ZoqHOYy_k
Slides
https://github.com/tspannhw/FLaNK-Milvus
https://medium.com/cloudera-inc/building-a-milvus-connector-for-nifi-34372cb3c7fa
https://www.youtube.com/watch?v=ssoM5S87BBs
Events
May 8-9, 2024: Data Summit 2024. Boston, MA.
https://www.dbta.com/DataSummit/2024/default.aspx
https://www.dbta.com/DataSummit/2024/Timothy-Spann.aspx
https://twitter.com/DBTADataSummit/status/1778393005646397636
May 21, 2024: Gen AI and Beyond with NiFi 2.0. Virtual.
May 30, 2024: Conf42: Machine learning
https://www.conf42.com/Machine_Learning_2024_Tim_Spann_enriching_generative_events
June 12, 2024: Budapest Data + ML Forum. Virtual.
https://budapestml.hu/2024/en/speakers/
June 20, 2024: AI Camp Meetup. NYC.
Sept 24, 2024: JConf.Dev. Dallas.
https://2024.jconf.dev/session/598816
Nov 5-7, 10-12, 2024: CloudX. Online/Santa Clara. https://www.developerweek.com/cloudx/
Nov 19, 2024: XtremePython. Online.
https://xtremepython.dev/2024/
Cloudera Events
https://www.cloudera.com/about/events.html
More Events:
https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe
Code
- https://github.com/tspannhw/FLaNK-python-processors
- https://github.com/tspannhw/FLANKAI-Boston
- https://github.com/tspannhw/FLaNK-IrelandTransit
Models
- https://github.com/ollama/ollama/releases/tag/v0.1.33-rc5
- https://github.com/vikhyat/moondream
- https://github.com/kermitt2/grobid
- https://huggingface.co/apple/OpenELM-3B-Instruct
- https://huggingface.co/namespace-Pt/Llama-3-8B-Instruct-80K-QLoRA
Tools
- https://krasjet.com/voice/pdf.tocgen/
- https://github.com/rcoh/angle-grinder
- https://www.brandur.org/logfmt
- https://iss.matteason.co.uk/
- https://echarts.apache.org/handbook/en/concepts/dataset
- https://www.chartjs.org/docs/latest/getting-started/usage.html
- https://js.cytoscape.org/
- https://github.com/apexcharts/apexcharts.js/
- https://github.com/andrewcourtice/ripl
- https://unovis.dev/gallery
- https://vega.github.io/vega-lite/
- https://observablehq.com/@observablehq/plot-gallery
- https://www.tremor.so/
- https://github.com/gyf304/dotenv
- https://wanix.sh/
- https://os-world.github.io/
- https://ollama.com/library/moondream
- https://forums.developer.nvidia.com/t/introducing-ollama-support-for-jetson-devices/289333
- https://www.youtube.com/@FLaNK-Stack
- https://github.com/kaytu-io/kaytu
- https://github.com/hashhar/jsonschema2sql
- https://threlte.xyz/
- https://github.com/kingjulio8238/memary
- https://zilliz.com/learn/how-to-enhance-the-performance-of-your-rag-pipeline
- https://medium.com/cloudera-inc/consuming-rss-feeds-from-flink-sql-eaf33c1a5a23
- https://huggingface.co/bigcode/starcoder2-15b-instruct-v0.1
- https://docs.pinecone.io/examples/notebooks
- https://github.com/JetBrains/xodus
- https://github.com/towhee-io/examples/tree/main/nlp/text_search
- https://github.com/openvinotoolkit/openvino_notebooks/tree/recipes/recipes
- https://github.com/zilliztech/kafka-connect-milvus
- https://github.com/timeplus-io/proton/tree/develop/examples/real-time-ai
- https://zilliz.com/what-is-gptcache
- https://github.com/zilliztech/VectorDBBench
- https://github.com/zilliztech/milvus_cli
- https://github.com/towhee-io/examples/blob/main/nlp/text_search/search_article_in_medium.ipynb
- https://seeed-studio.github.io/SenseCraft-Web-Toolkit/#/setup/process
- https://github.com/McGill-NLP/webllama
- https://github.com/Mozilla-Ocho/llamafile
- https://prometheus-eval.github.io/prometheus
- https://github.com/mozilla-ai/lm-buddy
- https://github.com/databonsai/databonsai
- https://github.com/cohere-ai/cohere-toolkit
- https://github.com/InternLM/lmdeploy
- https://github.com/TagStudioDev/TagStudio
- https://github.com/Axorax/tkforge
- https://github.com/hiyouga/LLaMA-Factory
- https://pub.towardsai.net/rag-2-0-finally-getting-rag-right-f74d0194a720
- https://github.com/brettdidonato/trendr-bot
- https://github.com/cloudera/CML_AMP_Intelligent-QA-Chatbot-with-NiFi-Pinecone-and-Llama2
- https://github.com/pydantic/logfire
- https://www.terminal.shop/
- https://github.com/penpot
- https://github.com/ytang07/ai_agents_cookbooks/tree/main/langchain
- https://voxel51.com/fiftyone/
- https://huggingface.co/apple/OpenELM-450M-Instruct?library=true
- https://zilliz.com/learn/effortless-ai-workflows-a-beginners-guide-to-hugging-face-and-pymilvus
- https://github.com/NVlabs/DoRA
- https://github.com/Blealtan/efficient-kan
- https://github.com/habuma/spring-ai-examples/tree/main/vector-store-loader
- https://github.com/eureka-research/DrEureka
- https://haystack.deepset.ai/integrations/milvus-document-store
- https://github.com/TencentARC/InstantMesh
- https://huggingface.co/NousResearch/Hermes-2-Pro-Llama-3-8B
- https://kai-waehner.medium.com/genai-demo-with-kafka-flink-langchain-and-openai-8c9b7263770a
- https://medium.com/@amanatulla1606/llm-web-scraping-with-scrapegraphai-a-breakthrough-in-data-extraction-d6596b282b4d
- https://news.mit.edu/2024/mit-faculty-instructors-students-experiment-generative-ai-teaching-learning-0429
- https://zilliz.com/blog/spring-ai-and-milvus-using-milvus-as-spring-ai-vector-store
Cool Tool
Convert Spark SQL to Trino SQL
https://github.com/linkedin/coral
Discount
Discount access to DataSummit 2024
https://secure.infotoday.com/RegForms/DataSummit/?Priority=24SPKR
Β© 2020-2024 Tim Spann https://www.youtube.com/@FLaNK-Stack
FLaNK-AIM with LLAMAΒ 3
Top comments (0)